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Preface 


The primary purpose of this book is to provide a rigorous and succinct de- 
scription of the mathematical methods and tools about the foundations of 
representing fuzzy information and reasoning with it within Semantic Web 
Languages. As such it may also be seen as a compendium on the matter. 

The development covers the three main streams of Semantic Web lan- 
guages: namely, triple languages RDF & RDFS, conceptual languages OWL, 
OWL 2 and their profiles (OWL EL, OWL QL and OWL RL), and rule-based 
languages such as RIF. 

No previous knowledge of fuzzy set theory or fuzzy logic is required for an 
understanding of the material in this text, although I assume that the reader is 
familiar with the basic notions of classical (non fuzzy) set theory and classical 
(two-valued) logic. 

All the theoretical and logical aspects of classical (two-valued) Semantic 
Web Languages are covered in the first six chapters, which are designated with 
Part I. Part II is then devoted to generalizing these languages to cope with 
fuzzy set theory and fuzzy logic and covers the remaining five chapters. 

The two parts together may help both the non-fuzzy set theory and fuzzy 
logic expert to get an insight into Fuzzy Semantic Web Languages, while on the 
other hand may help the non Semantic Web Languages expert to get a better 
understanding of the theoretical fundamentals of Semantic Web Languages. 

The book also contains an extensive bibliography, which covers virtually 
all relevant books and papers published by 2012, which may help the younger 
readers in keeping track of previous work. 


Umberto Straccia 
February 16th, 2013 
Pisa, Italy 


xi 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


List of Figures 


1.1 
1.2 
1.3 
1.4 
1.5 


2.1 
2.2 


3.1 


4.1 
4.2 
4.3 
4.4 
4.5 
4.6 
4.7 


8.1 


8.2 
8.3 
8.4 
8.5 
8.6 
8.7 
8.8 


10.1 
10.2 


10.3 
10.4 
10.5 


Distributed information retrieval scenario. .......... 4 
Matchmaking. =... sole ts oa ca he oy a ea 5 
Fuzzy numbers... Ne at ai d Ae ek e be a A le ek da 6 
Excerpt of an ontology about interesting sites in a city. . . . 7 
Very low, low, fair, high, and very high hotel prices... .. . 8 
SWLs from a logical perspective. ..... o... ...... 12 
OWE familys so EA AE EG 14 
Triples:as:praph: y sus wavs ai na ate gia it d 18 
Excerpt of the DL-Lite family... ............... 43 
Clash-free complete completion-forest from ABox...... . 50 
Clash-free complete completion-forest for acyclic KB. ... . 52 
Blocking in ALC: mera ela be să 53 
Clash-free completion-forest from ALC KB. ......... 55 
A DL classification run. o... .. o... 59 
An EL** classification example................. 65 


(a) Trapezoidal function trz(a, b, c,d); (b) Triangular function 
tri(a, b,c); (c) L-function Is(a,b); and (d) R-function rs(a,b). 106 


Fuzzy sets over salaries using trapezoidal functions. .... . 106 
Fuzzy sets over salaries using triangular functions... .... 107 
Graphical view of fuzzy set complement operation. ..... 108 
Graphical view of fuzzy set intersection operation... .... 108 
Graphical view of fuzzy set union operation. ......... 108 
Some examples of fuzzy quantifiers... ............ 123 
Linear modifier Im(a,b). .. 0... ee 127 
Annotation property defining concept BuyerPreferences. . . . 191 
Annotation property defining fuzzy datatype 1s(22000, 

DAA e tas Edo et as 192 
Menu options of the Fuzzy OWL 2 plug-in............ 193 
Creation of a fuzzy datatype with the Fuzzy OWL 2 plug-in. 194 
Clash-free complete completion-forest from fuzzy ABox. . . 206 


xiii 


xiv 


List of Figures 


Clash-free and complete completion-forest from acyclic fuzzy 


KBeusd a is o at A Aa 207 
Clash-free and complete completion-forest from fuzzy 

ALCOR Baoa t Fe Aaa ea RS LA bt A Me Boa 209 
OR-based complete completion-forest from fuzzy ALC KB.. 217 
A fuzzy DL classification run. .. cc. 224 
A fuzzy EL** classification example. ............. 228 
Pairwise blocking in SHZF. ......o.o.. ooo... oo. 281 


Clash-free complete completion-forest for SHTF KB... .. 283 


List of Tables 


Semantic conditions for classical pdf interpretations. ... . 20 
Inference rules for pdf. . o... a 24 
Syntax and semantics of the DL SROZO(D). ........ 40 
Syntax ot ELFE (D) AI OO 42 
The tableau rules for ALC with empty TBox. ........ 49 
The tableau rules to deal with acyclic TBox.......... 51 
The tableau rules for ALC KBs with GCIs. ......... 54 
NNF transformation rules for E£** TBoxes.. ........ 62 
Completion rules for ELTE. l.. ta alee eee Bo 64 


The complexity of various reasoning task for the OWL family. 71 


Query driven computation related to Example 20....... 86 
Fundamental properties of crisp set operations. ....... 103 
Axioms for t-norms and s-norms. ...........-.... 109 
Axioms for negation functions... ............... 110 
Some properties of t-norms. .. ooo a 116 
Properties for implication and negation functions... .... 117 
Some additional properties of combination functions of various 

DOTA hs ee a ee aS Sy ae A a a e Ge he a 120 
Combination functions of various fuzzy logics. ........ 129 


Mapping of propositional SFL statements to crisp statements. 151 
Mapping of propositional Lukasiewicz logic statements to 


crisp statements. ...........-.. 00002520008 152 
Mapping of matrix-based fuzzy propositional statements to 

crisp statements. .......... 0... 0000000004 152 
Fragment of a P(D) fuzzy KB example for fuzzy control... 161 
Inference rules for fuzzy pdf. . . cc... 170 
Syntax and semantics of the fuzzy DL SROTQ........ 177 
Some additional fuzzy concept equivalences. ......... 179 
From fuzzy SROTQ to fuzzy SROTQ(D)........... 185 
Crisp mapping of concept and role expressions. ....... 201 
Crisp reduction of the fuzzy DL axioms. ..........., 201 


XV 


xvi 


10.6 
10.7 
10.8 
10.9 
10.10 
10.11 


10.12 
10.13 
10.14 


A.l 
A.2 
A.3 


C.1 


H.1 
H.2 
H.3 


List of Tables 


The tableau rules for fuzzy ALC with empty TBox...... 
The tableau rules for fuzzy ALC with acyclic fuzzy KB. 

The tableau rules for fuzzy ALC with GCIs. ......... 
The tableau rules for finitely valued fuzzy ALC with GCIs. . 
The OR-based tableau rules for fuzzy ALC with empty TBox. 
The OR-based tableau rules for fuzzy ALC with acyclic fuzzy 
KBS edt Tee ok Dlg ot howe ok nd ca d ee ony 1 a E a 
The OR-based tableau rules for fuzzy ALC with GCIs. .. . 
Normalization rules for fuzzy EL? TBoxes........... 
Completion rules for fuzzy EL*.. cc... 


RDFS vocabulary. . ............. 2020000 ee 
RDFS axiomatic triples. . . ooa 
Semantic conditions for classical RDFS interpretations. . . . 


The tableau rules for SHZFg.. cc... 


The tableau rules for fuzzy SHIF g. <<... o... .-. 
The tableau rules for finitely valued fuzzy SHTF,. ..... 
The OR-based tableau rules for fuzzy SHZF, with GCIs. 


Chapter 1 


The Quest for Fuzzy Logic in 
Semantic Web Languages 


Semantic Web Languages (SWLs), such as triple languages RDF & RDES [77] 
(Resource Description Framework), conceptual languages of the OWL 2 family 
(Ontology Web Language) [340] and rule languages of the RIF (Rule Inter- 
change Format) family [365], are languages to provide a formal description of 
concepts, terms, and relationships within a given knowledge domain and to 
be used as well to write the metadata that typically annotates any kind of 
web-data. 

A large body of work has been carried out addressing various aspects, such 
as computational complexity, reasoning algorithms, and scalability. Moreover, 
the standardization of languages such as RDFS and OWL 2, together with 
the emergence of various implemented tools, allows us to access in a computer 
processable and uniform way, large bodies of general purpose and/or domain 
depended knowledge spread over the Web, that can be used, e.g., to infer new 
knowledge (which may be injected back on the Web). 

However, the restriction of SWLs to classical, two-valued /bivalent logic 
has limitations related to its inability to semantically cope with the inherent 
“imperfection” of web-data. That is, the inability to deal with e.g., 


Inconsistency: One may easily find, e.g., different Gross Domestic Product 
(GDP) values for a specific country such as illustrated by Wikipedia’, 
in which e.g., the GDP according to the United Nations? and the Inter- 
national Monetary Fund? differ. 


Trust: Software agents may gather pieces of data from various information 
sources on the Web to collect them together and/or infer new knowledge. 
As we filter information that we think wasn’t derived in a scientifically 
viable way and we have the ability to trust and distrust sources, so 
should software agents as well. The missing indication of the degree of 
trustworthiness of the sources and the inferred knowledge may lead to 
the propagation of “unreliable” information. 


Non-monotonicity: The major problem that monotonic knowledge bases 


Inttp://en.wikipedia.org/wiki/List_of_countries_by_GDP_ (nominal) 
2http://www.un.org 
Shttp://www.imf .org/external/index.htm 
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face is that of inconsistency. Here it is assumed that if a true statement 
s can be derived from a set K of statements, then s can also be derived 
from every larger set K’ that contains K. This seems an appealing as- 
sumption to make, because it allows reasoning to be local and to only 
take into account the rules and facts that are immediately necessary to 
infer a new statement. But it is also an unrealistic assumption to make, 
because the world, even the formalized one, is full of contradictions and, 
typically, the increase of information at hand and assumptions made 
previously may become invalid. For instance, from the statement “usu- 
ally Pm in office during office time” one may infer that “I will be at 
the office Monday morning.” However, if later on I become aware of the 
fact that Monday I will be travelling, the inferred statement becomes 
invalid. 


Uncertainty: A severe issue related to SWLs is the inability to deal with the 
inherent uncertainty in the gathered data. With uncertainty we mean 
the scenario in which any statement is true or false, but due to the 
limited knowledge we have access to, we are unable to exactly establish 
whether the statement is either true or false. 


For instance, the GDP is the market value of all officially recognized 
final goods and services produced within a country in a given period 
and the GDP per capita is often considered an indicator of a country's 
standard of living. Now, one method to determine the GDP is the sum 
of private consumption, gross investment, government spending, and the 
difference among exports and imports, i.e., 


GDP=C+I+G+(X—M). 


Now, very likely the values of C,I,G,X and M cannot be gathered 
(by humans and/or software agents) exactly and, thus, can only be 
approximated. We may also provide an estimation of the probability 
of e.g., being the value of C the approximated value vc. That is, we 
may further provide the measure Pr(C = vc). Therefore, the exact 
value of the GDP is approximated with vapp as well and accompanied 
with the probability of the statement s “the GDP is vapp” being true, 
i.e., p = Pr(GDP = vapp). The point now is that the correct anno- 
tation of the statement “the GDP is vapp” with the probability p of 
being this statement true cannot be semantically done properly in cur- 
rent SWLs. This may cause a loss of information once these values are 
put back on the Web and further processed automatically by a software 
agent to determine, e.g., an indicator of a country’s standard of living. 
In fact, we may become unaware again of the degree of reliability on the 
inferred information. 


Fuzziness: Natural language in general, and web-data specifically, are per- 
vaded with concepts that are vague, specifically fuzzy, in the sense that 
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statements, such as “the indicator of a country’s standard of living is 
low”, cannot always to be determined to be either true or false because 
it is unclear how to define exactly the involved term “low GDP”. Ina 
similar fashion, we face difficulties to figure out the answers to a request 
such as “find me the countries with a low GDP” (as an exercise, try 
to give an answer to the query for countries with low GDP listed in 
Wikipedia, say, according to the UN ranking). Concepts such as, low, 
high, warm, cold, dry, humid, etc., are called fuzzy concepts and rely 
on fuzzy set theory. That is, while in classical set theory a set has crisp 
boundaries, i.e., an object is either a member of a set S or it is not, 
in fuzzy set theory instead, an object may be member of a set to some 
more fine grained degree; usually a degree taken from the unit interval 
(0, 1]. Therefore, statements and answers are graded. 


Incorporating all the above dimensions has not yet been worked out, even 
though there has been extensive research on each of them. 


In this book, we will focus on fuzziness only, which may address 
some interesting application scenarios as succinctly illustrated be- 
low. 


Ontology-based Information Retrieval. In Ontology-based Information 
Retrieval (OBIR), one may determine the degree to which a web site, a Web 
page, a text passage, an image region, a video segment, database records, 
etc., is relevant to an information need expressed using a domain specific on- 
tology. In a fuzzy setting the notion of “relevance” or “aboutness” is indeed 
context dependent and subjective. That is, the notions of relevance and about- 
ness are fuzzy concepts and specific similarity functions are defined to imple- 
ment such notions. Here the top-k retrieval problem, i.e., the retrieval of the 
top-k ranked answers, where each answer is ranked according to the degree to 
which it satisfies the query, becomes an important one (see also e.g., [65, 126, 
320, 344, 386, 392, 404, 409, 410, 414, 417, 420, 421, 434, 435]). An illustrative 
example query may be: “find top-k cheapest hotels close to the train station”. 

A more general case consists of the so-called Distributed Ontology-based 
Information Retrieval (DOBIR) scenario, as depicted in Figure 1.1 (see [433]). 

In DIR, a software agent has to perform automatically the following steps: 


1. The agent has to select a subset of relevant resources S’ C S, as it is 
not reasonable to access and to query all resources (using techniques of 
resource selection/resource discovery); 


2. For every selected source S; € S’ the agent has to reformulate its infor- 
mation need Q4 into the query language £; provided by the resource 
(using techniques of schema mapping/ontology alignment); 


3. The relevant results from the selected resources have to be merged to- 
gether (using techniques of data fusion/rank aggregation). 
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Query Q 


FIGURE 1.1: Distributed information retrieval scenario. 


Concerning point 1, both the determination of the relevant source involves 
fuzziness as well the representation of the score of the degree of relevance of 
each source associated to a query. For point 2, a schema mapping can roughly 
be seen as a graded relation about the relatedness among the terms of the 
query and the ones of the sources’ vocabulary, while in point 3 one needs to 
represented the score of the individual items that have been retrieved. 


Ontology-based Matchmaking. In Ontology-based Matchmaking typically 
a buyer specifies his graded preferences over the product he wants to buy, while 
on the other hand sellers specifies theirs. The objective of the software agent 
here is to find the best possible agreement (called Pareto optimal solution 
or Nash equilibrium) between a buyer and the sellers (see e.g., [48, 355, 356, 
357, 358, 359, 360, 99, 430, 431, 432]). An excerpt may be the following (see 
also Figure 1.2). A car seller sells the car Audi TT for € 31500, as from the 
catalogue price. A buyer is looking for a sports car, but wants to pay no 
more than around € 30000. In classical set theory the problem relies on the 
crisp conditions on price. A more fine grained approach consists of considering 
prices as vague constraints, ¿.e., fuzzy sets, as usual in negotiation (moreover, 
the notion of a sports car is vague as well); 


1. The seller may sell above € 31500, but can go down to € 30500; 
2. The buyer prefers to spend less than € 30000, but can go up to € 32000; 
So the highest degree of matching is 0.75 and the car may be sold at € 31250. 


Ontology-based Multi-Criteria Decision Making. Multi-Criteria De- 
cision Making (MCDM) is among one of the most well known branches of 
decision-making. Roughly, MCDM is the study of identifying and choosing 
alternatives based on the values and preferences of the decision maker. Mak- 
ing a decision implies that there are alternative choices to be considered and 
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Buyer Seller 
30000 31500 


0 30500 32000 
31250 


FIGURE 1.2: Matchmaking. 


to choose the one that best fits our goals, objectives, desires, values, and so 
on [445] (see also [418]). 

Usually, alternatives represent different choices of action available to the 
decision maker and is assumed to be finite in our case. The decision criteria 
represent the different dimensions from which the alternatives can be viewed 
(decision criteria are also referred to as goals or attributes). Most of the MCDM 
methods require the criteria to be assigned decision weights of importance. 
Usually, these weights are normalized to add up to one. 

A standard feature of MCDM methods is that a MCDM problem can be 
expressed by means of a decision matrix, as shown below 


Criteria 
wi wa Wm 
Alternatives | C1 | Ca . . Cy, 
$1 A Q11 | 412 : i Alm (1 1) 
$2 A» a21 | G22 i i 02m 
Sn An an1 An2 J y Anm 


In the matrix each column belongs to a criterion C and each row describes 
the performance of an alternative A;. The value a;; describes the performance 
of alternative A; against criterion C;. Weight w; reflects the relative impor- 
tance of criteria Cj to the decision. The weights of the criteria are usually 
determined on a subjective basis and may also be seen as a kind of profit 
of the criteria. They may represent the opinion of a single decision maker or 
synthesize the opinions of a group of experts. Eventually, s; is the overall score 
of alternative A; computed using some aggregation method using the weights 
wij and performance values as; (1 < j < m). 
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FIGURE 1.3: Fuzzy numbers. 


Here, fuzziness arises naturally in the description of the criteria, offers, the 
relative importance of the criteria and the performance of each alternative 
against each criteria (see, e.g., the area of fuzzy MCDM [225, 443)). 

For instance, assume that we have to decide which offer to choose for the 
development of a public school (see matrix representation below): 


Offer Cost DeliveryTime | Quality 
Fair Low High 
A | VeryPoor Fair Good 
Az Good VeryGood Poor 
A3 Fair Fair Poor 


e There are three offers A; (alternatives), described using the terms of 
an ontology, which have been evaluated manually or semi-automatically 
by one or more experts, or fully automatically by one or more software 
agents, according to three criteria 


— Cost, Delivery Time, Quality 


e The performance values of the alternatives w.r.t. a criteria are also 
vague, i.e., are expressed in terms of qualitative degrees (e.g., fuzzy 
numbers [242], see also Figure 1.3) such as 


— Very Poor, Poor, Fair, Good, Very Good 


e The relative importance of each criteria is also expressed using fuzzy 
numbers such as 


— Very Low, Low, Fair, High, Very High 


Then a key aspect concerns the problem of determining the score of each 
alternative and computing the top-k ranked alternatives. 


Ontology-based Data Mining. In Ontology-based Data Mining (OBDM), 
the goal is to discover structured knowledge from an ontology-based and 
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FIGURE 1.4: Excerpt of an ontology about interesting sites in a city. 


usually large data set, where the ontology represents the background infor- 
mation on the domain. Here, fuzzy approaches play an important role, be- 
cause they may provide human comprehensible, i.e., better interpretable, re- 
sults [246, 260, 261]. 

For instance, consider the case of hotel finding in a possible tourism ap- 
plication, where an ontology is used to describe the meaningful entities of the 
domain* (see also Figure 1.4). 

Now, one may fix a city, say Pisa, extract the characteristic of the hotels 
from web sites and the graded hotel judgements of the users e.g., from Trip 
Advisor? and ask about what characterizes good hotels. Then one may learn 
that, e.g., that “a hotel having a high price is a good hotel [260, 261)”. In 
this case, the notion of high price has been determined automatically form 
the data (see Figure 1.5). 


4nttp: //donghee. info/research/SHSS/0bjectiveConceptsOntology (0CO) .html 
Shttp://www.tripadvisor.com 
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VL=Very Low L=Low F=Fair 


H= High VH = Very High 


FIGURE 1.5: Very low, low, fair, high, and very high hotel prices. 


We refer the interested reader to the FOIL-DL system®, for further 
insights. 


Ontology-based Geographic Information System. Geographic Informa- 
tion Systems (GISS) are widely used nowadays. It is quite obvious, how- 
ever, that many spatial features often do not have clearly defined boundaries, 
and concepts like moderate slope, strong elevation, close to a lake, near to 
a major road, located in a dry region, etc., can better be expressed with 
degrees of membership to a fuzzy set than with a binary yes/no classifica- 
tion [226, 419, 438]. 

The above mentioned areas, besides many more, may hopefully convince 
the reader that fuzzy set theory and fuzzy logic may contribute positively to 
make knowledge, expressed via SW Ls, besides being mechanically processable, 
more suitable for human reading and information interchange. 


Shttp://nmis.isti.cnr.it/-straccia/software/FOIL-DL/index.html 
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Chapter 2 


Introduction 


The Semantic Web is a “web of data” whose goal is to enable machines to 
understand the semantics, or meaning, of information on the World Wide Web. 
In rough terms, it should extend the network of hyperlinked human-readable 
Web pages by inserting machine-readable metadata! about pages and how 
they are related to each other, enabling automated agents to access the Web 
more intelligently and perform tasks on behalf of users. 

Semantic Web Languages (SWLs) are the languages used to provide a for- 
mal description of concepts, terms, and relationships within a given knowledge 
domain to be used to write the metadata. There are essentially three family 
of languages: namely, 


e Triple languages RDF & RDES [77] (Resource Description Framework); 


e Conceptual languages of the OWL 2 family (Ontology Web Lan- 
guage) [340]; and 


e Rule languages of the RIF family (Rule Interchange Format) [365]. 


While their syntactic specification is based on XML [463], their semantics is 
based on logical formalisms (see Fig. 2.1): briefly, 


e RDFS is a logic having intensional semantics and the logical counterpart 
is pdf [325]; 


e OWL 2 is a family of languages that relate to Description Logics 
(DLs) [19]; 


e RIF relates to the Logic Programming (LP) paradigm [263]. 


Both OWL 2 and RIF have an extensional semantics. 

Of course, having standard languages to represent and reason about do- 
main knowledge is of little use if we cannot appropriately query it. To this 
purpose, the query language SPARQL has been defined [380, 381], and con- 
sidered as one of the key technologies of the Semantic Web. 


1Obtained manually, semi-automatically, or automatically. 
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Class/Concept 
Language 


FIGURE 2.1: SWLs from a logical perspective. 


2.1 RDF  RDFS 


The basic ingredients of RDF are triples of the form (s,p,o), such as 
(umberto, likes, tomato), stating that subject s has property p with value o. 
In RDF Schema (RDFS), which is an extension of RDF, additionally some 
special keywords may be used as properties to further improve the expressivity 
of the language. For instance we may also express that the class of tomatoes 
are a subclass of the class of vegetables (tomato, sc, vegetable), while Zurich 
is an instance of the class of cities (zurich, type, city). 

From a computational point of view, one computes the so-called closure 
(denoted cl(K)) of a set of triples K. That is, one infers all possible triples 
using inference rules [307, 325, 362], such as 


(A, sc, B), (X, type, A) 
(X, type, B) 


“if A subclass of B and X instance of A then infer that X is 
instance of B”, 


and then store all inferred triples into a relational database to be used then for 
querying. We recall also that there are several ways to store the closure cl(K) 
in a database (see [1, 211]). Essentially, either we may store all the triples 
in table with three columns subject, predicate, object, or we use a table for 
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each predicate, where each table has two columns subject, object. The latter 
approach seems to be better for query answering purposes. Note that making 
all implicit knowledge explicit is viable due to the low complexity of the closure 
computation, which is O(|K|?) in the worst case. 


2.2 The OWL Family 


The Web Ontology Language OWL [338] and its successor OWL 2 [101, 
340] are “object-oriented” languages for defining and instantiating Web on- 
tologies. Ontology (see, e.g., [174]) is a term borrowed from philosophy that 
refers to the science of describing the kinds of entities in the world and how 
they are related. An OWL ontology may include descriptions of classes, prop- 
erties, and their instances, such as 


class Person partial Human 
restriction (hasName someValuesFrom String) 


restriction (hasBirthPlace someValuesFrom Geoplace) 


“The class Person is a subclass of class Human and has two at- 
tributes: hasName having a string as value, and hasBirthPlace 
whose value is an instance of the class Geoplace”. 


Given such an ontology, the OWL formal semantics specifies how to derive its 
logical consequences. For example, if an individual Peter is an instance of the 
class Student, and Student is a subclass of Person, then one can derive that 
Peter is also an instance of Person in a similar way as it happens for RDFS. 
However, OWL is much more expressive than RDFS, as the decision problems 
for OWL are in higher complexity classes [345] than for RDFS. In Fig. 2.2 
we report the various OWL languages and as subscript the DL they relate 
to [19, 125]. 

OWL 2 [101, 340] is an update of OWL 1 adding several new features, 
including an increased expressive power. OWL 2 also defines several OWL 2 
profiles, i.e., OWL 2 language subsets that may better meet certain computa- 
tional complexity requirements or may be easier to implement. The choice of 
which profile to use in practice will depend on the structure of the ontologies 
and the reasoning tasks at hand. The OWL 2 profiles are: 


OWL 2 EL is particularly useful in applications employing ontologies that 
contain very large numbers of properties and/or classes (basic reasoning 
problems can be performed in time that is polynomial with respect to 
the size of the ontology [13, 18]). The EL acronym reflects the profile’s 
basis in the EL family of description logics [13, 18]. 


OWL 2 QL is aimed at applications that use very large volumes of instance 
data, and where query answering is the most important reasoning task. 
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OWL Full 
OWL2 
SROIQ 
A 
OWL DL 
SHOIN 
OWL Lite 
SHIF 
OWL 2 RL OWL 2 EL 
Datalog EL++ 
OWL 2 QL 
DL-Lite 


FIGURE 2.2: OWL family. 


In OWL 2 QL, conjunctive query answering can be implemented us- 
ing conventional relational database systems. Using a suitable reason- 
ing technique, sound and complete conjunctive query answering can 
be performed in LogSpace with respect to the size of the data (asser- 
tions) [12, 87]. The QL acronym reflects the fact that query answering 
in this profile can be implemented by rewriting queries into a standard 
relational query language such as SQL [449]. 


OWL 2 RL is aimed at applications that require scalable reasoning without 


sacrificing too much expressive power. OWL 2 RL reasoning systems 
can be implemented using rule-based reasoning engines, as a mapping 
to Logic Programming [263], specifically Datalog [449], exists. The RL 
acronym reflects the fact that reasoning in this profile can be imple- 
mented using a standard rule language [173]. The computational com- 
plexity is the same as for Datalog [113] (polynomial in the size of the 
data, ExpTime w.r.t. the size of the knowledge base). 


2.3 The RIF Family 


The Rule Interchange Format (RIF) aims at becoming a standard for ex- 


changing rules, such as 


Forall ?Buyer ?Item ?Seller 
buy(?Buyer ?Item ?Seller) :- sell(?Seller ?Item ?Buyer) 
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“Someone buys an item from a seller if the seller sells that item to 
the buyer” 


among rule systems, in particular among Web rule engines. RIF is in fact a 
family of languages, called dialects, among which the most significant are: 


RIF-BLD The Basic Logic Dialect is the main logic-based dialect. Techni- 
cally, this dialect corresponds to Horn logic with various syntactic and 
semantic extensions. The main syntactic extensions include the frame 
syntax and predicates with named arguments. The main semantic ex- 
tensions include datatypes and externally defined predicates. 


RIF-PRD The Production Rule Dialect aims at capturing the main aspects 
of various production rule systems. Production rules, as they are cur- 
rently practiced in mainstream systems like Jess? or JRules?, are defined 
using ad hoc computational mechanisms, which are not based on a logic. 
For this reason, RIF-PRD is not part of the suite of logical RIF dialects 
and stands apart from them. However, significant effort has been ex- 
tended to ensure as much sharing with the other dialects as possible. 
This sharing was the main reason for the development of the RIF Core 
dialect. 


RIF-Core The Core Dialect is a subset of both RIF-BLD and RIF-PRD, thus 
enabling limited rule exchange between logic rule dialects and produc- 
tion rules. RIF-Core corresponds to Horn logic without function symbols 
(î.e., Datalog [2, 449]) with a number of extensions to support features 
such as objects and frames as in F-logic [238]. 


RIF-FLD The Framework for Logic Dialects is not a dialect in its own right, 
but rather a general logical extensibility framework. It was introduced 
in order to drastically lower the amount of effort needed to define and 
verify new logic dialects that extend the capabilities of RIF-BLD. 


2.4 The Query Language SPARQL 


Strictly speaking, SPARQL [381, 380] is a query language for data that 
is stored natively as RDF or viewed as RDF via middleware. From a logical 
point of view, its logical counterpart is the well-known notion of conjunc- 
tive/disjunctive query. As such, we may see SPARQL essentially as a query 
language for databases and, indeed, has much in common with SQL. 

While SPARQL was originally proposed to query RDFS graphs only, in 
the meanwhile, by relying on the representation of OWL and RIF in RDFS, 


2http://www. jessrules.com/ 
Shttp://www.ilog.com/products/jrules/ 
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SPARQL was used to query OWL 2 and RIF ontologies as well, via the def- 
inition of the so-called entailment regimes. In fact, what correct answers to 
a SPARQL query are depends on the used entailment regime [379] and the 
vocabulary from which the resulting answers can be taken. 

As an illustrative example, the following is a SPARQL query 


SELECT ?p ?c 

WHERE { (?p, type, ebayEmp) 
OPTIONAL {(?p, hasCar, ?c) } 
} 


whose answers are Ebay employees that optionally owned a car. Note that the 
basic ingredients of the query are triples. 


Chapter 3 


Resource Description Language RDF 
& RDF Schema 


3.1 Introduction 


The Resource Description Framework (RDF) is a language for representing 
information about resources in the World Wide Web [304, 361] and has become 
a quite popular Semantic Web representation formalism. It is particularly 
intended for representing metadata about Web resources, such as the title 
and author of a Web page. However, by generalizing the concept of a “Web 
resource,” RDF can also be used to represent information about things that 
can be identified on the Web, even when they cannot be directly retrieved on 
the Web. For our purposes we can think of it as anything we can identify. 

RDF is based on the idea of identifying things using identifiers (called 
Uniform Resource Identifiers, or URIs) and describing resources in terms of 
simple properties and property values. RDF properties may be thought of as 
attributes of resources and in this sense correspond to traditional attribute- 
value pairs. This enables RDF to represent a simple statement about resources 
as a triple (s,p,o), such as (umberto, likes, tomato), stating that subject s 
has property p with value o. Triples may also be represented as graphs, where 
nodes represent resources or values and arcs represent properties, as shown in 
Figure 3.1. In RDF Schema (RDES) [77], which is an extension of RDF, addi- 
tionally some special keywords may be used as properties to further improve 
the expressivity of the language. For instance (see Figure 3.1) we may also 
express that the class of “tomatoes are a subclass of the class of vegetables”, 
while Zurich is an instance of the class of cities. 

In what follows, we will describe the essential features of RDFS from a 
syntax, semantics, and a reasoning point of view that are necessary for our 
discussions later in Chapter 9 when we introduce fuzzy RDFS. Our exposition 
is along the line followed by [177, 307, 325] to describe syntax, semantics, and 
inference system for the “core” part of RDES, called pdf [325]. 

We refer the reader to Appendix A for a complete formal definition of the 
semantics of RDFS using the notion of interpretation defined here. 
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FIGURE 3.1: Triples as graph. 


3.2 RDF and RDFS 


Syntax. Consider pairwise disjoint alphabets U, B, and L denoting, respec- 

tively, URI references, blank nodes, and literals. We assume U, B, and L fixed, 

and for ease we will denote unions of these sets simply concatenating their 

names. We call the elements in UBL (B) terms (variables, denoted z, y, 2). 
An RDF triple is a triple 7 of the form 


(s,p,o) € UBL x U x UBL. 


We call s the subject, p the predicate, and o the object. Note that as in [325] 
we allow literals for s. A graph G is a set of triples, a subgraph is a subset of 
a graph, the universe of G, universe(G), is the set of elements in UBL that 
occur in the triples of G, the vocabulary of G, voc(G), is universe(G) NUL. 

For our purposes, we rely on a fragment of RDFS, called pdf [325], that 
covers essential features of RDFS (read rho-df, the p from restricted RDF). 
pdf is defined as the following subset of the RDFS vocabulary: 


pdf = {sp, sc, type, dom, range} , 
where the keywords in pdf may be used in triples as properties. Informally, 


p, Sp, q) means that property p is a sub-property of property q; 
c 


e (c,sc,d) means that class c is a subclass of class d; 


( 
( 
e (a,type,b) means that a is of type b; 
(p, dom, c) means that the domain of property p is c; and 
( 


p, range, c) means that the range of property p is c. 
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Remark 1. In a First-Order Logic (FOL) setting, we may interpret classes 
as unary predicates, and (RDF) predicates as binary predicates. Then 


1. a subclass relation between class c and d may be encoded as the formula 
Vu.c(x) > d(x) 
2. a subproperty relation between property p and q may be encoded as 
VaVy.p(z, y) > q(x, y) 
3. domain and range properties may be represented as: 


VaWy.p(x,y) >c[x) and VaWy.p(z,y) > cly) 


4. the transitivity of a property can be represented as 


VaWyaz.(plx,z) Ap(z,y)) > p(x, y) - 


Although this remark is trivial, we will see that it will play an important role 
in the formalization of fuzzy RDFS. 


In what follows we define a map or (variable assignment) as a function u : 
UBL > UBL preserving URIs and literals, i.e., u(t) = t, for all t € UL. 
Given a graph G, we define u(G) = {(u(s), u(p), u(0)) | (s,p,o) € G}. We 
speak of a map y from Ga to Ga, and write u : Gi > Ga, if u is such that 
p(G1) € Go. 

A grounding G’ of graph G is obtained, as usual, by replacing variables in 
G with terms in UL. A graph G without variables is called ground. 


Example 1. The following is a graph (partially represented in Figure 3.1) 


G ={ (umberto, likes, tomato), (umberto, born, zurich), 
(tomato, type, edibleF ruit), (edibleFruit, sc, fruit), 
(born, dom, person), (born, range, city) } . 


Note that G is ground. 


Semantics. An interpretation T over a vocabulary V is a tuple 
T= (Ar, Ap, Ac, Az, PEI, CL], 7) , 


where Ar, Ap, Ag, Az are the interpretation domains of Z, which are finite 
non-empty sets, and P[:], C[-], % are the interpretation functions of Z. They 
have to satisfy: 


1. Ar are the resources (the domain or universe of T); 
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TABLE 3.1: Semantic conditions for classical pdf interpretations. 
Simple: 1. for each (s,p,o) € G, p? € Ap and (s7,07) € Plp?]; 


Subproperty: 1. P[sp7] is transitive over Ap; 
2. if (p,q) € Plsp?] then p,q € Ap and Plp] € Pla]; 


Subclass: 1. P[sc?] is transitive over Ac; 
2. if (c,d) € P[sc?] then c,d € Ac and C[c] € C[d]; 


Typing I: 1. x € Cle] if and only if (x,c) € Pltype?]; 
2. if (p,c) € P[dom*] and (x,y) € P[p] then x € Cl]; 
3. if (p,c) € Plrange*] and (a, y) € P[p] then y e Cl]; 


1. for each e € pdf, e? € Ap 

2. if (p,c) € P[dom?] then p € Ap and c € Ac 
3. if (p,c) € Plrange*] then p € Ap and c € Ac 
4. if (x,c) € Pltype*] then c € Ac 


Typing II: 


2. Ap are property names (not necessarily disjoint from Ag); 
. Ac C Ap are the classes; 

. Ap C Ap are the literal values and contains LN V; 

. PĮ] is a function P|]: Ap > 242*48; 


„C[] is a function C[]: Ac > 248; 


N DO a A W 


. -Z maps each t € ULN V into a value t? € Ap U Ap, and such that -7 
is the identity for plain literals and assigns an element in Ar to each 
element in L. 


An interpretation Z is a model of a ground graph G, denoted T E G, if 
and only if Z is an interpretation over the vocabulary pdf U universe(G) that 
satisfies the conditions in Table 3.1. 

Entailment, denoted G + H, among ground graphs G and H is as usual: 
a ground graph G entails a ground graph H if and only if any model of G is 
also a model of H. The generalization to non-necessarily ground graphs is as 
follows: a graph G entails a graph H, denoted G |= H, if and only if for any 
grounding G” of G there is a grounding H’ of H such that G’ = H’. 


Remark 2 (Reflexivity issue). In [325], the authors define two vari- 
ants of the semantics: the default one includes reflexivity of Plsp*] 
(resp. C[sc*]) over Ap (resp. Ac). Here we are only considering the 
alternative semantics presented in [325, Definition 4] which omits this 
requirement. Thus, we do not support entailment of triples such as 
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range), that depend on the reflexivity and are of marginal interest anyway. 
See [325] (cf. p. 59) for a more in depth discussion on this issue. 


Let us denote with E™ the entailment relation |= of pdf in which we include 
also the reflexivity of P[sp7] and C[sc7]. Then it can be shown that 


Proposition 1 ([325]). Let Œ and H be pdf graphs. Assume that H does not 
contain triples of the form (a,sp,a) nor (a,sc,a) for a € UL, nor triples of 
the form (x,sp, y) nor (x,sc, y) for x € B or y € B. Then, 


GE H if and only if GEA. 


Essentially the above theorem states that the only use of reflexive restrictions 
in RDFS models is the entailment of triples of the form (a, sp, b), (a, sc, b), or 
their existential versions replacing the subject or object by variables (blank 
nodes). 

The next proposition shows that }='™ retains the original semantics of 
RDFS. Let Erprs be the RDFS entailment relation defined in [307, 362] (see 
Appendix A). 


Proposition 2 ([325]). Let G and H be graphs that do not mention RDFS 
vocabulary outside pdf. Then 


G rors H if and only if GE“ H. 


Combining Propositions 1 and 2 we may characterize our entailment relation 
= in pdf with respect to full RDFS semantics as follows. 


Corollary 1. Let G and H be graphs that do not mention RDFS vocabulary 
outside pdf. Assume that H does not contain triples of the form (a,sp,a) nor 
(a,sc, a) fora € UL, nor triples of the form (x,sp, y) nor (a,sc,y) for x € B 
ory€B. Then, 


G Erprs H if and only if GHH. 


Remark 3 (Axiomatic triples). Note that in pdf models we do not impose the 
a priori satisfaction of any axiomatic triple. Indeed, pdf models do not satisfy 
any of the RDFS axiomatic triples in [307, 362] (see Appendix A, Table A.2), 
because all of them mention RDFS vocabulary outside pdf. This is also the rea- 
son for the inclusion of the “Typing II” conditions in pdf models that capture 
the semantics restrictions imposed syntactically by the RDFS axiomatic triples 
(dom, dom, prop), (dom, range, class), (range, dom, prop), (range, range, class), and 
(type, range, class), and the fact that every element in pdf must be interpreted 
as a property. 


Another property of |= is that it does not entail axiomatic triples: 


Proposition 3 ([325]). There is no pdf triple T such that E 7. 


22 Foundations of Fuzzy Logic and Semantic Web Languages 


As we will see, Corollary 1 and Proposition 3 simplify the reasoning machinery 
for pdf significantly. 
We conclude this section with an example. 


Example 2. Consider the graph G in Example 1. Then the following entail- 
ment relations hold: 


G FE (tomato,type, fruit) 
G | (umberto,type, person) 
G FE (zurich,type, city) . 


For instance, concerning G = (tomato, type, fruit), informally the argument 
is as follows. A tomato is an edible fruit, edible fruits are fruits and, thus, a 
tomato is a fruit. Concerning G E (zurich, type, city), we have that umberto 
is born in Zurich, if someone is born somewhere then this latter is a city and, 
thus, Zurich is a city. The case G E (umberto, type, person) is similar. 


3.3  Conjunctive Queries 


Concerning query answering, we are inspired by [177] and the logic pro- 
gramming setting [263] and we assume that a RDFS graph G is ground. That 
is, blank nodes have been skolemized, i.e., replaced with terms in UL. 

A conjunctive query is of the rule-like form 


a(x) — dy.p(x, y) (3.1) 


where q(x) is the head and Jy.p(x, y) is the body of the query, which is a 
conjunction (we use the symbol “,” to denote conjunction in the rule body) 
of triples 7; (1 < i < n). x is a vector of variables occurring in the body, 
called the distinguished variables, y are so-called non-distinguished variables 
and are distinct from the variables in x, each variable occurring in 7; is either 
a distinguished or a non-distinguished variable. If clear from the context, we 
may omit the existential quantification dy. 
A query example is: 


q(x, y) — (y, created, x), (y, type, Italian), (x, exhibited At, Uf fizi) 


having intended meaning to retrieve all the artifacts x created by Italian artists 
y, being exhibited at Uffizi Gallery. 

In order to define an answer to a conjunctive query we introduce the fol- 
lowing definitions. Given a vector x = (11,...,Ux) of variables, a substitution 
over x is a vector of terms t replacing variables in x with terms of UBL. 
Then, given a query g(x) + dy.y(x,y), and two substitutions t,t’ over x 
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and y, respectively, the query instantiation y(t, t’) is derived from y(x, y) by 
replacing x and y with t and t’, respectively. 

Note that a query instantiation p(t,t') is an RDF graph. 

Given a ground graph G, a query q(x) — Jy.p(x, y), and a vector t of 
terms in universe(G), we say that q(t) is entailed by G, denoted G E q(t), if 
and only if there is a vector t’ of terms in universe(G) such that in any model 
T of G, Tis a model of the query instantiation y(t, t’). 

If G = q(t) then t is called an answer to q. The answer set of q w.r.t. G 
is defined as ans(G, q) = {t | GE q(t)}. 


3.4 Reasoning 


In what follows, we provide a sound and complete deductive system (for the 
graph entailment problem) for our language derived from [325]. The complete 
RDFS inference rules are presented in Appendix A. 

The system is arranged in groups of rules that capture the semantic con- 
ditions of models. In every rule, A, B,C, X, and Y are meta-variables repre- 
senting elements in UBL and D, E represent elements in UL. The rules are 
described in Table 3.2. 


Remark 4 (On rules (5a) and (5b)). As noted in [307, 440], the set of rules 
presented in [362] is not complete for RDFS entailment. The problem is pro- 
duced when a blank node X is implicitly used as standing for a property in 
triples like (a,sp, X),(X,dom, b), or (X, range, c). Here we solve the problem 
following the elegant solution proposed by [307] adding just two new rules of 
implicit typing (rules 5 above). 


Remark 5 (Rules for reflexivity). A reader familiar with [325] will notice 
that these rules are as rules 1-5 of [325] (which has 7 rules). We excluded the 
rules handling reflexivity (rules 6-7) that are not needed in our setting. 

Furthermore, as noted in [325], the “Implicit Typing” rules are a neces- 
sary addition to the rules presented in [362] for complete RDFS entailment. 
These represent the case when variable A in (D,sp, A) and (A,dom, B) or 
(A, range, B), is a property implicitly represented by a blank node. 

For completeness, we recap the missing rules 6-7 of [325] below: 


6. Subproperty Reflexivity: 
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TABLE 3.2: Inference rules for pdf. 


1. Simple: 
(a) EFforamapp:G'>G 
(b) S for G' CG 


2. Subproperty: 


(A,sp,B),(B,sp,C) 
(a) aa 


(b) (D,sp,Z),(X,D,Y) 


(X,E,Y) 
3. Subclass: 
(A,sc, B),(B,sc,C) 
(a) (A,sc,C) 


(A,sc,B),(X,type, A) 
(b) (X type B) 


4. Typing: 
(D,dom,B),(X,D,Y) 
(a) pa 
(D,range,B),(X,D,Y) 
Da MI 
5. Implicit Typing: 


(A,dom, B),(D sp, A), (X,D,Y) 
(a) (X,type, B) 


(A,range, B),(D,sp,A),(X,D,Y) 
(b) = O ype B) 


(X,A,Y) 
a) (ma) 


(Asp, B) 
b) 1421 (895) 


c) umm frp E pdf 


Ap, 
d) ees) for p € {dom, range} 


7. Subclass Reflexivity: 


A,sc,B 
(a) set 


(A,sc,B) 
(b) (B,sc,B) 


(c) ee for p € {dom, range, type} 


An instantiation of a rule is a uniform replacement of the metavariables oc- 
curring in the triples of the rule by elements of UBL, such that all the triples 
obtained after the replacement are well-formed RDF triples. 
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A proof is defined in the usual way. Let G and H be graphs. Then G H H 
iff there is a sequence of graphs P,,..., Pp with Pi = G and Pk = H, and for 
each j (2 < j < k) one of the following holds: 


1. there exists a map u : P; > P;-1 (rule (1a)); 
2: Pj € Pa (rule (1b)); 


3. there is an instantiation = of one of the rules (2)(5), such that R C Pj-a 
and Pj = Pj-1 U R. 
The sequence of rules used at each step (plus its instantiation or map), is 
called a proof of H from G. 


Example 3. Consider Example 1. The following is a proof of G + 
(tomato, type, fruit): 


(1) (tomato, type, edibleFruit) Rule (1b) 

(2) (edibleFruit,sc, fruit) Rule (1b) 

(3) (tomato, type, fruit) Rule (3b) applied to (1) and (2) . 
The following proposition shows that our proof mechanism is sound and com- 
plete w.r.t. the pdf semantics: 


Proposition 4 (Soundness and Completeness [325]). Inference + based on 
rules 1-5 is sound and complete for =, that is, 


GLH if and only if GHH. 


Proposition 5 ([325]). Assume Gt G” then there is a proof of G’ from G 
where the rule (1a) is used at most once and at the end. 


Corollary 2 ([325]). Define the proof system F"* as for pdf by adding also 
the rules of reflexivity (rules (6) and (7)). Then for graphs G and H, 


G =” H if and only if GP" H. 


Corollary 3 ([325]). Assume G +" H then there is a proof of H from G 
where the rule (1a) is used at most once and at the end. 


For notational convenience, we denote with {7,...,7} Heroes T that the 
consequence 7 is obtained from the premise 7),...,7, by applying one of the 
inference rules with the exclusion of rules (la) and (15). 

The closure of a graph G is defined as cl(G) = {r | GF* 7}, where P* is 
as F except that rule (La) is excluded. Note that the size of the closure of G 
is polynomial in the size of G and that the closure is unique. 


Example 4. Consider the graph G in Example 1. Then the closure of G is: 
dG) =GU{_ (tomato, type, fruit), 
(umberto, type, person), 


(zurich, type, city) }. 
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Using the closure, we may immediately prove that 
Proposition 6. Any graph G has a model. 
More importantly, from a practical point of view, it can be proven that: 


Proposition 7 ([325]). Gt H if and only if H C cl(G) or H is obtained 
from cl(G) by applying rule (la). 


Finally, note that a simple method to determine G |= 7, where both G and 
T are ground, consists in computing cl(G), where the size of the closure of G 
is O(|G|?), and check whether 7 is included in cl(G) [325]. [325] provides also 
an alternative method to test G E 7 that runs in time O(|G| log |G|). 

There also several ways to store the closure cl(G) in a database (see [1, 
211]). Essentially, either we may store all the triples in a table with three 
columns subject, predicate, object, or we use a table for each predicate, where 
each table has two columns subject, object. The latter approach seems to be 
better for query answering purposes. 


Example 5. Consider Example 2. Then the one table variant to store cl(G) 


is: 
closure 

subject predicate | object 
umberto likes tomato 
umberto born zurich 
tomato type edibleFruit 
edibleFruit | sc fruit 
born dom person 
born range city 
tomato type fruit 
umberto type person 
zurich type city 


On the other hand, in case we use one table for each predicate we have the 
tables: 


likes born type 
subject | object subject | object subject | object 
umberto | tomato umberto | zurich tomato | edibleFruit 


tomato | fruit 
umberto | person 
zurich | city 


sc dom range 
edibleFruit | fruit born | person born | city 


We have also the following complexity result: 
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Proposition 8 ([440]). Entailment for RDFS is decidable, NP-complete, and 
in P if the target graph does not contain blank nodes. 


We conclude the chapter by showing how to compute the answer set of a 
conjunctive query (see Section 3.3). The following can be shown: 


Proposition 9. Given a ground graph G, t is an answer to q if and only if 
there exists an instantiation y(t, t’) that is true in the closure of G (i.e., all 
triples in p(t, t’) are in cl(G)). 


Therefore, we have a simple method to determine ans(G, q). Namely, compute 
the closure cl(G) of G and store it into a database, e.g., using the method [210]. 
It is easily verified that any query can be mapped into an SQL query over the 
underlying database schema. Hence, ans(G, q) can be determined by issuing 
such an SQL query to the database. 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


Chapter 4 


Web Ontology Language OWL 


4.1 Introduction 


The Web Ontology Language OWL [338] and its successor OWL 2 [101, 
340] are languages for defining and instantiating Web ontologies. Ontology 
(see, e.g., [174]) is a term borrowed from philosophy that refers to the science 
of describing the kinds of entities in the world and how they are related. 

An OWL ontology may include descriptions of classes, properties, and their 
instances. Given such an ontology, the OWL formal semantics specifies how to 
derive its logical consequences, i.e., facts not literally present in the ontology, 
but entailed by the semantics. An OWL ontology is a formal conceptualization 
of a domain of interest and consists of the following three different syntactic 
categories: 


e Entities, such as classes, properties, and individuals, are identified by 
URIs and can be thought of as primitive terms or names. Entities repre- 
sent basic elements of the domain being modelled. For example, a class 
Person can be used to model the set of all people. Similarly, the object 
property parentOf can be used to model the parent-child relationship. 
Finally, the individual Peter can be used to represent a particular person 
called “Peter”. 


e Expressions represent complex notions in the domain being modelled. 
For example, a class expression describes a set of individuals in terms 
of the restrictions on the individuals’ features. 


e Axioms are statements that are asserted to be true in the domain being 
modelled. For example, using a subclass axiom, one can state that the 
class Student is a subclass of the class Person. 


These three syntactic categories are used to express the logical part of OWL 
ontologies — that is, they are interpreted under a precisely defined semantics 
that allows useful inferences to be drawn. For example, if an individual Peter is 
an instance of the class Student, and Student is a subclass of Person, then from 
the OWL semantics one can derive that Peter is also an instance of Person in a 
similar way as it happens for RDFS. However, OWL is much more expressive 
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than RDFS, as we will see later on, as the decision problems for OWL are in 
higher complexity classes [345] than for RDFS. 

We recall that the OWL language provides three increasingly expressive 
sublanguages designed for use by specific communities of implementers and 
users. 


e OWL Lite supports those users primarily needing a classification hi- 
erarchy and simple constraint features. For example, while OWL Lite 
supports cardinality constraints, it only permits cardinality values of 0 
or 1. It should be simpler to provide tool support for OWL Lite than 
its more expressive relatives, and provide a quick migration path for 
thesauri and other taxonomies. Deciding entailment in OWL Lite is 
ExpTime-complete. 


e OWL DL supports those users who want the maximum expressiveness 
without losing computational completeness (all entailments are guaran- 
teed to be computed) and decidability (all computations will finish in 
finite time) of reasoning systems. More precisely, deciding entailment in 
OWL DL is NExpTime-complete. OWL DL includes all OWL language 
constructs with restrictions such as type separation (a class cannot also 
be an individual or property, a property cannot also be an individual or 
class). OWL DL is so named due to its correspondence with Description 
Logics (DLs) [19, 125], a family of FOL fragments. 


e OWL Full is meant for users who want maximum expressiveness and the 
syntactic freedom of RDF with no computational guarantees. For exam- 
ple, in OWL Full a class can be treated simultaneously as a collection 
of individuals and as an individual in its own right. Deciding entailment 
in OWL full is undecidable. 


Each of these sublanguages is an extension of its simpler predecessor, both 
in what can be legally expressed and in what can be validly concluded. The 
following set of relations hold. Their inverses do not. 


1. Every OWL Lite ontology is an OWL DL ontology. 
2. Every OWL DL ontology is an OWL Full ontology. 
3. Every OWL Lite conclusion is an OWL DL conclusion. 
4. Every OWL DL conclusion is an OWL Full conclusion. 


OWL 2 [101, 340] is an update of OWL adding several new features, including 
an increased expressive power—mainly w.r.t. properties, extended support for 
datatypes, simple meta modelling capabilities, extended annotation capabili- 
ties, database style keys. OWL 2 also defines several OWL 2 profiles, i.e., OWL 
2 language subsets that may better meet certain computational complexity 
requirements or may be easier to implement. The choice of which profile to use 
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in practice will depend on the structure of the ontologies and the reasoning 
tasks at hand. The OWL 2 profiles are: 


e OWL 2 EL is particularly useful in applications employing ontologies 
that contain very large numbers of properties and/or classes. This pro- 
file captures the expressive power used by many such ontologies and 
is a subset of OWL 2 for which the basic reasoning problems can be 
performed in time that is polynomial with respect to the size of the 
ontology [13, 178]. Dedicated reasoning algorithms for this profile are 
available and have been demonstrated to be implementable in a highly 
scalable way. The € £ acronym reflects the profile’s basis in the E£ family 
of description logics [13], logics that provide only existential quantifica- 
tion (see later on). 


e OWL 2 QLis aimed at applications that use very large volumes of in- 
stance data, and where query answering is the most important reasoning 
task. In OWL 2 QL, conjunctive query answering can be implemented 
using conventional relational database systems. Using a suitable rea- 
soning technique, sound and complete conjunctive query answering can 
be performed in LogSpace with respect to the size of the data (asser- 
tions) [12, 87]. Asin OWL 2 EL, polynomial time algorithms can be used 
to implement the ontology consistency and class expression subsumption 
reasoning problems. The expressive power of the profile is necessarily 
quite limited, although it does include most of the main features of con- 
ceptual models such as UML class diagrams and ER diagrams. The QL 
acronym reflects the fact that query answering in this profile can be im- 
plemented by rewriting queries into a standard relational query language 
such as SQL [449]. 


e OWL 2 RLis aimed at applications that require scalable reasoning with- 
out sacrificing too much expressive power. It is designed to accommo- 
date OWL 2 applications that can trade the full expressivity of the lan- 
guage for efficiency, as well as RDFS applications that need some added 
expressivity. OWL 2 RL reasoning systems can be implemented using 
rule-based reasoning engines, as a mapping to Logic Programming [263], 
specifically Datalog [449], exists. The ontology consistency, class ex- 
pression satisfiability, class expression subsumption, instance checking, 
and conjunctive query answering problems can be solved in time that is 
polynomial with respect to the size of the set of facts. The RL acronym 
reflects the fact that reasoning in this profile can be implemented using 
a standard rule language. The design of OWL 2 RL was inspired by the 
Horn-DL family, also called Description Logic Programs (DLPs) in [173] 
and pD* [440]. 
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4.2 Description Logics Basics 


Description Logics (DLs) [19, 125] play a fundamental role for the OWL 
family of languages as they are the theoretical /logical counterpart of them. In 
fact, as OWL languages can be mapped into DLs, algorithms and computa- 
tional complexity results found for DLs are then applied to the OWL family. 
For the sake of clarifying the concept, in Figure 2.2 we resume the OWL family 
of languages, the related computational complexity of reasoning within them! 
and the DL that they refer to: specifically, 


e OWL 2 refers to the DL SROZO [101, 203] 

e OWL 1 DL refers to the DL SHOZN (201, 204, 205, 207, 208] 

e OWL 1 Lite refers to the DL SHIF [204, 205] 

e OWL 2 QL refers to the DL-Lite family, specifically DL-Liteg [12, 87] 
e OWL 2 EL refers to the DL EL family, specifically ELY [13, 18, 178]. 
e OWL 2 RL refers to the Horn-DL family [173, 440). 


We recall that each DL is usually identified with a string of calligraphic letters, 
each of which indicates that the basic DL, AL has been extended with a 
specific constructor [19]. 

DLs are a logical reconstruction of the so-called frame-based knowledge 
representation languages, with the aim of providing a simple well-established 
Tarski-style declarative semantics to capture the meaning of the most pop- 
ular features of structured representation of knowledge. The basic syntactic 
building blocks are atomic concepts (unary predicates), atomic roles (binary 
predicates), and individuals (constants). The expressive power of the language 
is restricted in that it uses a rather small set of constructors for building com- 
plex concepts and roles. Specifically, a DL assumes three alphabets of sym- 
bols, for concepts, roles, and individuals. A concept may be understood as a 
unary predicate (a class), while a role may be seen as a binary predicate (prop- 
erty /attribute of a class). In each DL, complex concepts and roles can be built 
using different concept, and role constructors and classes may be organized 
into a hierarchy. 


4.2.1 The Basic Description Language A£ 


Elementary descriptions are atomic concepts, also called concept names 
(denoted A) and atomic roles (denoted R). Complex concepts (denoted C) 
can be built from them inductively with concept constructors. 


1See http: //www.cs.man.ac.uk/~ezolin/dl/ 
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The basic DL is called AL (Attributive Language) [371]. Concepts in AL 
are formed according to the following syntax rule: 


CID > A | (atomic concept) 
AN | (universal concept) 
zl | (bottom concept) 
=A | (atomic negation) 
COD | (concept conjunction) 
VR.C | (universal restriction) 
3R.T (unqualified existential restriction) 


Note that, in AL, negation can only be applied to atomic concepts, and only 
the top concept is allowed in the scope of an existential quantification over a 
role. Also, the sublanguage of AL obtained by disallowing atomic negation is 
called FL (Frame Description Language) and the sublanguage of FL obtained 
by disallowing limited existential quantification is called FLo [76]. 

The table below provides an informal First-Order Logic (FOL) reading of 
concepts and relative concept expression examples. 


Syntax FOL translation Example 
CID > A | | A(x) Person 
T Te) 
L [fie 
=A | | -4(x) —Femal 
CND || C(2)A D(z) Person M Femal 
VR.C | | wy.R(z,y) > C(x) | Person N VhasChild.Femal 
JR. Jy.R(x, y) Person [1 ShasChild. T 


For instance, Person M VhasChild.Femal will denote those persons all of whose 
children are female. Using the bottom concept, we can also describe those 
persons without a child by the concept Person M VhasChild. L. 

We will often use the abbreviation 3R in place of the unqualified existential 
restriction 4R.T. 

A TBoz T, or Ontology, consists of a finite set of General Concept Inclusion 
axioms (GCIs) C E D, where C and D are concepts (read it as “all instances 
of C are instances of D”). The FOL view of a GCI C E D is informally 
Vx.C(x) + D(a). For ease, we use C = D € T in place of CED,DECET. 
Given an atomic concept A and concept C, we call a GCI of the form A E C 
primitive and call a GCI of the form A = C definitional. An example of GCI 
is Male E —Female. 

An ABoz A consists of a finite set of concept and role assertion axioms a:C 
and (a, b):R, respectively, where a and b are individuals. Examples of assertion 
axioms are tim:Person (“tim is a person”) and (tim, pat):hasChild. (“tim has 
pat as child”). 

A knowledge base K = (T, A) consists of a TBox 7 and an ABox A. For 
ease, sometimes we write K as well as K = T UA. 
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Example 6. A simple AL KB is 


K ={_  tim:Person, (tim, pat):hasChild, pat:Male, 
Male = —Female, Person E VhasChild.Person, 


MalePerson = Person M Male, FemalPerson = Person NM Femal | . 


From a semantics point of view, an interpretation T is a pair T = (A7,-7) 

consisting of a non-empty set AZ (called the domain) and of an interpretation 

function 2 that assigns to each atomic concept a subset of AZ, to each role 

a subset of A? x AZ and to each individual a an element in A? such that 

al 4 b? if a 4 b (this assumption is called Unique Name Assumption - UNA). 
The mapping -7 is extended to concepts as follows: 


TI — AZ 
T 
I = M4 
Y = CIAD 
nA = AP AP 
O SeA RAG) Cer} 
(ARTY = {xe A7 ]|RT(x) 20) 


where R? (x) = (y: (x,y) € RT). 

The satisfiability of an axiom E in an interpretation Z = (A7, 2), denoted 
T KE E, is defined as follows: Z = CE D iff C? C DI, TE aC iff a? e CF, 
and T E (a,b):R iff (af, b?) e RZ. 

For a finite set of axioms E, we say that Z satisfies € iff T satisfies each 
element in €. If T |= E (resp. Z E €) we say that Z is a model of E (resp. 
€). T satisfies (is a model of) a knowledge base K = (T, A), denoted Z E K, 
iff Z is a model of each component T and A, respectively. An axiom F is a 
logical consequence of a knowledge base K, denoted K |= E, iff every model of 
K satisfies E. Determining whether K [5 a:C is called the instance checking 
problem, while determining whether K + C E D is called the subsumption 
problem. 

Given K, we say that two concepts C and D are equivalent, denoted C =x 
D iff in any model Z of K, C? = D7. We say that C is coherent iff there is 
a model Z of K such that C7 4 (). If the knowledge base is empty, we simply 
omit K as subscript. 


Example 7. Consider the KB in Example 6. It can be verified that the KB 
is satisfiable and that 


K 
K |  pat:FemalPerson . 


MalePerson E —FemalPerson 


Informally, e.g., K E pat:FemalPerson holds because Pat is not male and, thus, 
is female. Tim is a person, a person has a person as child, and, thus, Pat, 
which is a child of Tim, is a person, too. Therefore, Pat is a female person. 
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4.2.2 The DL Family 

We next provide an overview of the major logics belonging to the DL 
family. 
4.2.2.1 DLs Naming Convention 


DLs are a family of logics that can be identified by the constructs they 
use. Typically, most constructs have a letter to identify them and their use 
will be denoted by adding the letter to the basic description language AL. For 
instance, the DL ALC, is obtained from AL by adding the construct identified 
by C (concept complement) to AL. Below, a list of major DL constructors, 
where C, D are concepts and Z is an interpretation. 


C: Concept negation, denoted —C, and semantics 
(aC)? = AF \ Ct. 
Note that =C = C. 
U: Concept disjunction, denoted C U D, and semantics 
(CUDY =CtUD*. 


Note that CU D = —(-0N—D). Also, a GCI C E D is the same as 
TECUD. 


€: Qualified existential restriction, 3R.C, and semantics 


(AR.C)* = {x e AT | RZ (x) N CT Ap). 


Note that 3R.C = ~YR.C. 


O: Nominals (singleton class), denoted {a}, and semantics 


(a? = {a7}. 


Note that a:C can be expressed as {a} E C, while (a,b):R can be ex- 
pressed as {a} E ARID). 


N: Number restrictions, denoted (> n R) and (< n R), and semantics 


(>n RY = (e A™| ER (z) > n} 
(<n R} = {xe AT|fR (2) <n}, 


where #5 is the cardinality of a set S. Note that (< n R) =7(> n+1 R). 
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O: Qualified number restrictions, denoted (> n R.C) and (< n R.C), and 
semantics 

(>n RO = (eA [HER (2)nc0?)>n) 

(<n R.CY (a e AT | ţ(R7(2) NC?) <n} 


| 


Note that (<n R.C) ==(2 n+1 R.C). 


T: Inverse role, denote R~, and semantics 


(R-Y = {(y,2) | (x,y) € RT). 


Note that (R~)~ is the same as R. Sometimes we write R~ with the 
intended meaning that RT =P” if R= P, and R = P, if R=P-. 


F: Local functional role, denoted (< 1 R), and semantics 
(<1 R = {z€ A7 | HR (x) < 1). 


F,: Global functional role axiom, denoted fun(S), and semantics 


T E fun(S) iff for all « € AZ. #97 (a) <1. 


H: Role inclusion axiom, denoted Rı E Ra, and semantics 


T | Ri C Ro if Rif C R2. 


R: Complex role inclusion axiom, denoted Ro S E R and Ro S CE S, and 
semantics 
TEFROSER if RoS? CR 
THRoSES iff Riostcs, 


Note that Ro SE R(RoS E S) is also denoted as RS E R (RS E S). 


R4: Transitive role axiom, denoted trans( R), and semantics 


trans(R)% iff R7 transitive . 


Note that trans(R) can be expressed as Ro RE R. 
S: Used for ALC with transitive roles, i.e., the DL ACCR.. 
We will also use the following shorthands: 


e Ci > Ca for =C1 LI Ca; 
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Cı > Ca for (Ci => C2) M (Ca y Ci); 


dom(R, C), called domain restriction axiom, for 3R.T EC; 


e ran(R,C), called range restriction axiom, for T E VR.O; 


e disj(C, D), called disjointness axiom, for CN D EL; 


(=n R.C) for (> n R.C) N (<n R.C); 
e (=n R) for (=n R.T). 


4.2.2.2 Concrete Domains 


Concrete domains [21, 294, 295] are used to extend DL languages to deal 
with datatypes, such as strings and integers. The elementary ingredients are 
as follows. We assume a set of data values, a set of elementary datatypes, and a 
set of datatype predicates, where each datatype predicate has a predefined ar- 
ity n> 1. A datatype is an elementary datatype or a finite set of data values. A 
datatype theory D= (AP, -?) consists of a datatype domain AP and a map- 
ping -P that assigns to each data value an element of AP, to each elementary 
datatype a subset of AP, and to each datatype predicate p of arity n a relation 
over AP of arity n. We extend -P to all datatypes by {v,,...}P ={vP,...}. 
For example, over the integers, >29 may be a unary predicate denoting the set 
of integers greater or equal to 20. Concerning roles, a role R is either an object 
property or a datatype property. An interpretation maps an object property 
into a subset of A? x AZ, while maps a datatype property into a subset of 
AZ x AP. A datatype property does not have an inverse. For instance, 


Person N Jage.>20 (4.1) 


is a concept denoting the set of people whose age is at least 20. 

We also use an alphabet for concrete individuals, denoted v, and extend 
an interpretation to concrete individuals by mapping them into AP. As for 
individuals, we adopt the UNA, i.e., v1? Æ vo? if vı £ vo. 


Example 8. Consider K with 


AdultPerson = Person N Jage.>20 
fun(age) 

tom:Person 

(tom, 34) :age 


Then 
K |= tom:AdultPerson . 


Informally, Tom is a person whose age is 34. Since an adult people is identified 
as a person having an age no less than 20, Tom is an adult person, too. 
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To indicate that a DL language £ has been extended with concrete domains, 
the convention is to append the label (D) to £. For instance, ALC(D) denotes 
the DL language ALC extended with the concrete domain D. 

Eventually, for an n-ary concrete predicate p, for functional datatype prop- 
erties f1,... fn, we will use the expression 


PCi În) 


as concept expression with semantics 


T = D, 
Pfi- fn) = {x € AT | dy, e AP, fil(x) = yi, (M1,---, Yn) E P KEE 
For instance, the concept expression 4.1 can also be written as 


Person N >20(age) . 


4.2.2.3 The AL Family and SROTQ(D) 


In the AL family we have essentially all languages obtained from AL by 
adding the construct identified by some letter described above. Prominent 
representatives are 


ALC. The DL ALC [371] is the reference DL language and is typically used 
whenever new ideas and extensions are explored. ALC is also closely 
related to Hybrid Logics [11], which are roughly modal logics allowing to 
talk about the worlds. This is not surprising due to the close relationship 
between ALC and Propositional Multimodal Logic K [370] (intuitively, 
VR.C > ORC and 1R.C > ORC and a:C rh @,C and TECH OC, 
where U is an universal accessibility relation?). From a computational 
complexity point of view, e.g., the knowledge base satisfiability problem 
is ExpTime-complete [131]. 


SHIF. The importance of the DL SHZF is due to the fact that it is the 
logical counterpart of the OWL 1 Lite [204, 205]. With that we mean 
that OWL 1 Lite constructs can be mapped into the DL SHZF(D). 
Note that in number restrictions, only so-called simple roles (i.e., which 
are neither transitive nor have a transitive subrole) are allowed. From a 
computational complexity point of view, e.g., the knowledge base satis- 
fiability problem is ExpTime-complete [193, 442]. 


SHOIN. The importance of the DL SHOZN is due to the fact that it is 
the logical counterpart of the OWL 1 DL [201, 204, 205, 207, 208], 
i.e., OWL 1 DL constructs can be mapped into the DL SHOIN (D). 
As for SHZF, in number restrictions, only simple roles are allowed. 
From a computational complexity point of view, e.g., the knowledge 
base satisfiability problem is NExpTime-complete [171, 441, 442]. 


2For all worlds w, w’, U(w, w’) holds. 
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SROTQ. The importance of the DL SROZO is due to the fact that it is the 
logical counterpart of the OWL 2 [101, 203], i.e., OWL 2 constructs can 
be mapped into the DL SROTQ(D). In number restrictions, only simple 
roles are allowed, the set of complex role inclusions (and ordinary role 
inclusions) is supposed to be acyclic (in a non-standard way), and there 
are some other features, such as disjoint roles, reflexive and irreflexive 
roles, asymmetric roles, etc. From a computational complexity point 
of view, e.g., the knowledge base satisfiability problem is 2NExpTime- 
complete [228]. 


For the sake of completeness, we recap here SROZO(D) [203]. The constructs 
are illustrated in Table 4.1, where we use the following notation: 


e C,D are concepts 


A is an atomic concept 
e Risan object property 


e S is a simple object property (also, called simple role, defined below) 


T is a datatype property 
e a,b are individuals, v is a concrete individual 
e disa concrete predicate 


e misa natural number 


Note that trans(R) is the same as RR E R, and sym(R) is the same as RT ER. 

The notion of simple role is defined as follows. We start with the definition 
of a (regular) role hierarchy whose definition involves a certain ordering on 
object properties, called regular. A strict partial order < on a set A is an 
irreflexive and transitive relation on 4. Á strict partial order on the set of an 
object properties R and their inverse R” is called a regular order if satisfies, 
additionally, S < R iff S~ < R”, for all object properties R and S. Note, 
in particular, that the irreflexivity ensures that neither S < S~ nor ST = S 
hold. Now, let < be a regular order. A role inclusion axiom (RIA for short) is 
an expression of the form w E R, where w is a finite string of object properties 
not including the universal role U, and R Æ U is an object property. A RIA 
w E R, is <-regular if 


1. w= RR, or 

2. w= R, or 

3. w = S1... Sn and S; < R, for all i, or 
4. w = RS1... Sn and S; < R, for all i, or 
5. w = S1... Sn Rand S; < R, for all i. 
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TABLE 4.1: Syntax and semantics of the DL SROTQ(D). 


Concepts Syntax (C) FOL Reading of C(x) 


(C1) A A(x) 
(C2) T 1 
(C3) JE 0 
(C4) cnp C(x) A D(x) 
(C5) CuD C(x) V D(x) 
(C6) =C aC (a) 
(C7) VR.C Vy. R(x, y) > C (y) 
(C8) 3R.C 3y.R(z,y) A Cy) 
(C9) vT.d Vv.T(2,v) > d(v) 
(C10) 3T.d 3v.T(z,v) A d(v) 
(C11) {a} z=a im 
(C12) (> m S.C) Jyı dum. Ni (S(x, yi) A C(yi)) A Ni<jck<m Yi F Yk 
(C13) (< m S.C) Vyr ++ VWym+1- Niza (S(z ya) A Ci) > Vi<j<r<m Yi = Yk 
(C14) (> m T.d) Ivi .. Dum. APL (T (a, vi) A d(vi)) A Ni<j<k<m vj É Uk 
(C15) (< m T.d) Vu1-..Vom41- Aj (T(a, vi) Ad(vi)) > Vi<j<k<m ti = Uk 
(C16) 35.Self S(x, x) 
Roles Syntax (R) Semantics of R(x, y) 
(RI) R R(z, y) 
(R2) R- R(y, x) 
(R3) U 1 
Axiom Syntax (E) Semantics (Z satisfies E if...) 
(A1) a:C C(a) 
(A2) (a, b):R R(a, b) 
(A3) (a, db): R —R(a,b) 
(A4) (a, v):T T(a, v) 
(A5) (a, v):T aT (a,v 
(A6) CED Va.C(x) + D(x) 
(A7) Ri ... Rn CR Vx1Vin+13x%0 . 
Awa (Ra (zu, 22) AN... A Rn EN bn+1)) > R(t1,%n+1) 
(A8) TET VxYv.Tı (x, v) > Tala, v) 
(A9) trans(R) VaVyVz.R(a,z) A R(z, y) > R(z, y) 
(A10) disj( S1, S2 VaVy.Si (x,y) A S2(£, y) =0 
(A11) disj(T1, T2) vavv.Ti (x, v) AT2(2,v) =0 
(A12) ref(R) Vi.R(x, x) 
(A13) irr(S) Va.—S(z,z) 
(A14) sym(R) VaWy.R(x, y) = R(y, 2) 
(A15) asy(S) VaWy, S(x, y) > 25(y, x) 


An RBoz, denoted R, consists of a finite set of axioms of the form (47) — (A15) 
in Table 4.1 and a SROTQ(D) knowledge base is now a triple K = (T, A, R), 
where R is regular and simple. R is regular if there is a regular order < 
such that all RIAs in R are <-regular. Essentially, regularity prevents a role 
hierarchy from containing cyclic dependencies. Eventually, simple roles are 
defined inductively as follows: 


e a role is simple if it does not occur on the right-hand side of a RIA in R 
e an inverse role R” is simple if R is, and 


e if R occurs on the right hand side of an RIA in R, then R is simple if, 
for each w E R € R, w = S for a simple role S. 


Then R is simple if all roles occurring in role axioms of the form (A9) — (A15) 
are simple. 
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With sub(K) we denote the set of (sub)concept expressions occurring in 
K. 


4.2.2.4 The €£ Family 


The importance of the EL DL family [13, 18, 178] is due to the fact that 
it is the logical counterpart of the OWL 2 EL profile [341], i.e., OWL 2 EL 
constructs can be mapped into the DL EL**(D). We recall that it enables 
polynomial time algorithms for all the standard reasoning tasks, i.e., the ontol- 
ogy satisfiability problem, the subsumption problem, and the instance check- 
ing problem and, thus, it is particularly suitable for applications where very 
large ontologies are needed, and where expressive power can be traded for 
performance guarantees. 

We next recap succinctly the DL E£ [17] and then extend it to 
ELt*(D) [13, 18]. Specifically, in EL [17] concept expressions are restricted 
to be of the form 


CID > A | (atomic concept) 
T | (universal concept) 
Cap | (concept conjunction) 
3R.C (qualified existential restriction) 


and a TBox consists of definitional GCIs of the form A = C only. 

EL**(D), which extends EL, has been presented in [13] and then further 
extend in [18]. E£+*(D) as from [13], extends E£ by allowing (i) the bot- 
tom concept L, nominal {a}, and concrete concept expressions of the form 
Plf1,..., fn); (ii) GCIs of the general form C E D; and (iii) RIAs. Eventu- 
ally, [18] allows further domain and range restrictions axioms and reflexive role 
axioms. Note that transitive role axioms, trans(R), are supported via RR E R. 
Note also that disjointness axioms, disj(C, D), are supported via C M D EL. 

For the sake of ease of presentation, we summarize here EL**(D) as 
from [18] in Table 4.2. 

Eventually, the DL E£* [15, 25] is as EL, except that inclusion axioms 
have a more general form than in EL: namely, general inclusion axioms are of 
the form C E D, where C, D are EL concepts, while role inclusion axioms are 
of the form R,---R, E R, where R; and R are role names. 


4.2.2.5 The DL-Lite Family 


The importance of the DL-Lite DL family [12, 82, 83, 87] is due to the fact 
that it is the logical counterpart of the OWL 2 QL profile [341], i.e., OWL 
2 QL constructs can be mapped into the DL DL-Liter (D), which, we recall, 
was designed so that sound and complete query answering is in LogSpace 
(more precisely, in AC?) with respect to the size of the data (assertions), 
while providing many of the main features necessary to express conceptual 
models such as UML class diagrams and ER diagrams. 
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TABLE 4.2: Syntax of EL**(D). 


Concept expressions: 


ta) 
cnp 
3R.C 
Pida) 
Axioms: CED 
RR ER 
dom(R) EC 
ran(R) EC 
ref(R) 
aC 
(a, b):R 


We next recap succinctly the DL DL-Lite DL family [87]. We start with the 
language DL-Litecore that is the core language for the whole family. Concepts 
and roles are formed according to the following syntax (A is an atomic concept, 
P is an atomic role, and P” is its inverse): 


B — A|AR 
C — B|-B 
R => P|P- 
E — R|-R. 


B denotes a basic concept, that is, a concept that can be either an atomic 
concept or a concept of the form 3R, where R denotes a basic role, that is, a 
role that is either an atomic role or the inverse of an atomic role. Finally, C 
denotes a concept, which can be a basic concept or its negation, whereas E 
denotes a role, which can be a basic role or its negation. Sometimes we write 
=C (resp., ~E) with the intended meaning that =C = —A if C = A (resp., 
E = ~R if E = R), and -C = A, if C = ~A (resp., ~E = R, if E = œR)’. 
Inclusion axioms are of the form 


BEC 


We might include Bı U Ba in the constructs for the left-hand side of inclusion 
axioms and C1 M C2 in the constructs for the right-hand side. In this way, 
however, we would not extend the expressive capabilities of the language, 
since these constructs can be simulated by considering that Bı U B2 E C is 
equivalent to the pair of assertions B E C and Ba E C, and that BE C¡MCa 


is equivalent to Bı N C1 and B E C4. Similarly, we might add L to the 
constructs for the left-hand side and T to those for the right-hand side. 


30f course, for any interpretation Z, (~R)? = AZ x AT \ R7. 
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DLR-Lite DLR-Lite 
Fall RN 


DLR-Lite 
core 


FIGURE 4.1: Excerpt of the DL-Lite family. 


Assertion axioms are, as usual of the form a:A and (a, b):P. Note that an 
assertion a:C can be simulated in DL-Litecore by adding A E C to the TBox, 
and a:A to the ABox, where A is a new atomic concept. Similarly, assertion 
axioms of the form (a,b):E can be simulated, with the same mechanisms, in 
those extensions of DL-Litecore that allow for inclusion axioms on roles (see 
below). 

DL-Litegr is now obtained by extending DI-Litecore with the ability of 
specifying inclusion axioms between roles of the form 


RCE. 


where R and E are defined as above, DL-Lite is obtained by extending 
DL-Litecore allowing the conjunction of basic concepts on the left-hand side 
of concept inclusions, and DL-Litez is obtained by extending DL-Litecore 
with global functional roles. Worth mentioning is also DLR [19, Chapter 16] 
and the DLR-Lite family [83, 84, 85, 86, 89], which is the extension of DL- 
Lite family to the case in which the predicates are no more unary (concepts) 
or binary (roles), but n-ary in general (see Figure 4.1 for an excerpt of the 
DL-Lite family). For a more detailed description of the DL-Lite family we 
refer the reader to [12]. 

We refer to assertions of the form Bı E Ba (resp. Ri E Ra) as positive 

inclusion assertions (PIs), and to assertions of the form Bı E Ba (resp. 
R, E Ra) as negative inclusion assertions (NIs). 
Note also that in fact, DL-Liter might be enhanced with the capability 
of managing qualified existential quantification on the right-hand side of in- 
clusion assertions on concepts [83]. This construct, however, can be simulated 
by suitably making use of inclusions between roles and unqualified existential 
quantification of concepts in inclusions between concepts (see Appendix B). 
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4.2.2.6 The Horn-DL Family 


The importance of the Horn-DL family [173, 440] is due to the fact that it 
is related to the logical counterpart of the OWL 2 RL profile [341]. Specifically, 
OWL 2 RL constructs can be mapped into a Horn-DL. This is achieved by 
defining a syntactic subset of OWL 2, which is amenable to implementation 
using rule-based technologies. Essentially, the restrictions are designed so as 
to avoid the need to infer the existence of individuals not explicitly present 
in the knowledge base, and to avoid the need for nondeterministic reasoning. 
Here we report the DL specification of OWL 2 RL. Specifically, concepts are 
formed according to the following syntax (A is an atomic concept, m € (0,1), 
lis a value of the concrete domain, R is an object property, a individual, T is 
a datatype property): 


B => A | {a} | BM Bə | B¡UB2|3R.B | 3T.d 

C —+ A|C¡MC¿|-B|VWR.C |IR{a} | YT.d | 
(<mS.B)(<m8)|(<mTd) 

D => AR.{a} | AT. =| Dı N Da 

R — P|P- 


Axioms have the form 
B 


A 
Rı 
Ri 


MAI 


IIA 


N 


and 


disj(B1, B2) 
dom(R, C) 
ran(R, C) 
dom(T, C) 
fun(R) 
irr(R) 
sym(R) 
asy(h) 
trans( R) 
disj(Rı, Rə) 


Assertion axioms are as for SROTQ(D), except that for in concept assertions 
the concept has to be of the form C above: i.e., 


a:C, (a, b):R, (a, b):=R, (a, v):T, (a,v): T . 


For completeness, Appendix B describes OWL 2 and its profiles and the map- 
ping to the appropriate DL. 
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4.3 Conjunctive Queries 


As for RDFS, we introduce the notion of conjunctive query (see Sec- 
tion 3.3). 

A conjunctive query is, similarly to RDFS (see Equation (3.1)) a rule-like 
expression of the form 


q(x) — dy.p(x, y) (4.2) 


where now the rule body (x,y) is a conjunction* of unary or binary predi- 

cates P;(z;) (| <i < n), where P; is either an atomic concept A or an atomic 

role R, and z; is a vector of distinguished or non-distinguished variables. If P; 

is an atomic concept (resp., a role) then z; is unary (resp., binary) tuple. 
For instance, by referring to Example 8, 


q(x, y) — AdultPerson(x), age(x, y) 


is a conjunctive query, whose intended meaning is to retrieve all adult people 
and their age. 

Given a vector x = (21,...,Y%k) of variables, a substitution over x is a 
vector of individuals t replacing variables in x with individuals. Then, given a 
query q(x) + 3y.p(x, y), and two substitutions t, t’ over x and y, respectively, 
the query instantiation p(t, t’) is derived from y(x, y) by replacing x and y 
with t and t’, respectively. Note that a query instantiation y(t, t’) is an ABox. 

We adopt here the following notion of entailment. Given a knowledge base 
K, a query q(x) + 3y.p(x, y), and a vector t of individuals occurring in K, 
we say that q(t) is entailed by K, denoted K = q(t), if and only if there is a 
vector t’ of individuals occurring in K such that in any model Z of K, Z isa 
model of any atom in the query instantiation y(t, t’). 

If K = q(t) then t is called a answer to q. We call these kinds of answers 
also certain answers. The answer set of q w.r.t. K is defined as 


ans(K, q) = {t | K E alt); - 


Remark 6. Note that there is a subtle difference with the usual definition of 
answer, stated as follows: 


Given a knowledge base K, a query q(x) — Jy.p(x,y), and a 
vector t of individuals occurring in K, we say that q(t) is entailed 
by K, denoted K |= q(t), if and only if for any model T of K, T is 
a model of the FOL formula Fy.p(t, y), according to the standard 
definitions of first-order logic. We denote the answer set according 
to this definition as ans*°"(K, q). 


4 Again we use the symbol “,” to denote conjunction in the rule body. 
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The following example shows the difference among our definition of answer set 
and the usual one according to Remark 6. So, consider the simple knowledge 
base 


K = ((a,c):R,b:3Ry 


and the conjunctive query 


Then it is easily verified that 


ans(K,q) = {a} 
ans¥OL(K,q) = {a,b}. 


The difference is due to the fact that in our case the instantiation of the non- 
distinguished variables has to be known independently from any model of K, 
while according to Remark 6, for any model Z of K, the non-distinguished 
variables y may be substituted with domain elements depending on Z. Of 
course, 

ans(K, q) C ans®°"(K, q) 


holds. 

In general, our definition is amenable to a more efficient implementation 
and typically the former has lower computational complexity than the latter. 
In fact, while determining whether a € ans(K, q) inherits the computational 
complexity of the complexity of entailment in the underlying DL (assuming 
the size of q bounded by the size of K), this may not be the case under Re- 
mark 6, e.g., determining whether a € ans" O*(K, q) for ALCT is 2ExpTime- 
complete [296] (see, e.g., [88, 145, 334, 335, 337, 336] for other results), while 
entailment can be decided in ExpTime [442]. 

Specifically, note that a simple procedure to determine the answer set 
ans(K, q) consists in computing off-line the instances of all atomics concepts 
and roles occurring in K, store them into a relational database DBx, convert 
q into and SQL query qsqL and submit gqsgr. As qsqL can be determined 
in LogSpace, determining whether a € ans(K, q) is in LogSpace”, where C is 
the complexity of checking entailment in the underlying DL. 

We conclude by defining a disjunctive query q as usual as a finite set of 
conjunctive queries in which all the rules have the same head. Intuitively, the 
answers to a disjunctive query are the union of the answers of the conjunctive 
queries. That is, for a disjunctive query q = {q1, . - - , qm}, K entails q (denoted 
K Eq) iff K [= qi for some q; € q. 
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4.4 Reasoning 


There are various reasoning problems to be addressed within DLs. The 
typical ones are resumed here (see, e.g., [19]). 


Consistency problem: 
e Is K satisfiable? + Is there some model Z of K? 
e Is C coherent? ++ CF 4 Ú for some model Z of K? 
Subsumption problem: 


eL =CED?»> Is it true that C? C D for all models Z of K? 


Equivalence problem: 


eKEC=D? Is it true that C? = D for all models Z of K? 


Instance checking problem: 


e KEa:C ? + Is it true that a? e CZ for all models Z of K? 


Instance retrieval problem: 


e Compute the set {a | K H a:C} 


Typically, all the above problems can be reduced to the knowledge base sat- 
isfiability problem as long as the below presented reductions are supported 
by the underlying DL language (if not, then specific algorithms have been 
developed). Indeed, we have 


e C is coherent w.r.t. K iff KU {a:C} is satisfiable, where a is a new 
individual; 


eKECE Diff KU {a:CN =D} is not satisfiable, where a is a new 
individual; 


eKEC=DiffeK ECC DandKEDCLCC; 


e KE a:C iff KU {a:7C} is not satisfiable. 
On the other hand, in order to determine whether 
KE (a,0):R , 
we may reduce it to the instance problem, as 


K E (a,b):R iff KU {b:B} Ea: 


RB, 


where B is a new concept. 
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4.4.1 The Case of the AL Family 


We succinctly describe here the major reasoning frameworks within the 
AL family, namely tableau algorithms. To do so, we describe a tableau al- 
gorithm deciding the knowledge base satisfiability problem for the DL ALC 
(see [19, 209, 206] and the paper [371] that originally has introduced this 
method), which is sufficiently expressive to illustrate the main characteristic 
of the tableau method. 

Let K = (T, A) be an ALC knowledge base. At first, we transform any 
concept in K into negation normal form (NNF), i.e., negation occurs only 
in front of atomic concepts. To do so, we push all negation signs as far as 
possible into the description, using de Morgan’s rules and the usual rules for 
quantifiers: that is, we iteratively apply the following transformation rules 


==C => C 

A(CND) > CUAD 

=(CUD) + -Cn=D (4.3) 
=3R.C > VRAC 

AVR.C = IRC. 


With nnf(C) we denote the negation normal form of C, obtained by applying 
the rules above. 

Then, we try to construct a (finite) model Z of K via a tableau algorithm. 
Essentially, the tableau algorithm is a terminating algorithm that, starting 
from an ABox, tries to build a clash-free forest of trees (called completion- 
forest). If it succeeds then K is satisfiable and from the forest a model can be 
build. Otherwise, K is not satisfiable. 


4.4.1.1 The Case with Empty TBox 


Let K be a KB in NNF with empty TBox. A completion-forest F for K 
is a collection of trees whose distinguished roots are arbitrarily connected by 
edges. Each node v is labelled with a set L(v) of concepts C € sub(K). The 
intuition here is that v is an instance of C. Each edge (v, w) is labelled with 
a set L((v, w)) of roles R occurring in K indicating that (v, w) and instance 
of R. 

If nodes v and w are connected by an edge (v, w) with R € L((v, wY) then 
w is called an R-successor of v and v is called an R-predecessor of w. As usual, 
ancestor is the transitive closure of predecessor. 

For a node v, £(v) is said to contain a clash iff {A,7A} C Liv). A 
completion-forest is called clash-free iff none of its nodes contain a clash; 
it is called complete iff none of the expansion rules in Table 4.3 is applicable. 

Now, the algorithm initializes a forest F as follows: 


e F contains a root node vj, for each individual a; occurring in A; 


e F contains an edge (vj, vj), for each assertion axiom (a;,a;):R € A; 
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TABLE 4.3: The tableau rules for ALC with empty TBox. 


— 


). If (î) Ci g Ca = L(v) and (îi) {C1, Co} Z L(v), then add Ci and Ca to 
L(v). 

( ). If (1) Ci U Ca € L(v) and (it) 10, Co} N L(v) = 0, then add some 
Ce 101, C2} to L(v). 

(V). If (i) VR.C € L(v) and R € L((v,w)) with C g L(w), then add C to 
L(w). 

3). If (i) IRC € L(v) and (ii) there is no R € L((v,w)) with C € L(w), 

then create a new node w, add R to £((v, w)) and add C to L(w). 


m 


e for each assertion a;:C € A, we add C to L(vj); 


e for each (a;,a;):R € A, we add R to L((vġ, 9)). 


Then the completion-forest F is then expanded by repeatedly applying the 
completion rules described in Table 4.3 and answers “K is satisfiable” iff the 
completion rules can be applied in such a way that they yield a complete and 
clash-free completion-forest. Note that the only non-deterministic rule is (U). 

We point out that it is relatively easy to build a model Z from a complete 
and clash-free completion-forest. Informally, 


e the domain of Z are the nodes of the forest; 
e the interpretation of individual a; is v; 

e if Re L((v, w)), then (v, w) € R7; 

e if A € L(v), then ve A”. 


Now, termination, soundness, and completeness of the algorithm have been 
shown. 


Proposition 10. For each knowledge base K = (0, A), 
1. the tableau algorithm terminates; 


2. if the expansion rules can be applied in such a way that they yield a 
complete and clash-free completion-forest, then K has a finite model; 


3. if K has a model, then the expansion rules can be applied in such a way 
that they yield a complete and clash-free completion-forest for K; 


4. the KB satisfiability problem is PSpace-complete [24, 16]. 
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(VR.AM3R.B,VR.A, 3R.B) 


a 


(A) b x {A,B} 


FIGURE 4.2: Clash-free complete completion-forest from ABox. 


Example 9. Consider 
K = {aNR.ANAR.B, (a,b):R, (a,c):P} . 


K is satisfiable as there is a clash-free completion-forest (see Figure 4.2). 
The model build from the forest is: 


e A? = {a,b,¢,2}; 

e the interpretation of individuals is the identity function; 
e RY = {(a,b), (a,2)}, P7 = (ac); 

e A7 = {b,x}, B? = {x}. 


4.4.1.2 The Case of Acyclic TBox 


In this section, we show how to extend the tableau calculus if we restrict the 
form of a TBox to so-called acyclic TBoxes. Specifically, let 7 be a Tbox in 
which the GCIs have one of the following forms 

AEC 


A= C 


where A is a concept name, C is a concept. That is, the TBox consists only 
of primitive and definitional GCIs. 

We say that A is the head of these axioms and C is the body. Furthermore, 
we also assume that no concept name A is in the head of more than one axiom. 
This further restriction allows to avoid that a GCI C E D may be introduced 
indirectly, e.g., via 


A = 
B = 


ua 
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Now, we say that 


e concept name A directly uses concept name B w.r.t. T, denoted A >y 
B, if A is the head of some axiom E € 7 such that B occurs in the body 
of E; 


e concept name A uses concept name B w.r.t. T, denoted A ~y B, if 
there exist concept names A4,..., An, such that A, = A, A, = B and, 
for every 1 < i < n, it holds that A; >r Aj+1. 


Eventually, we say that a TBox 7 is cyclic (acyclic) if there is (no) A such 
that A ~y A. 

An interesting feature of an acyclic KBs is that an acyclic T'Box is elimi- 
nated by systematically replacing any defined atom by means of its definition 
(this procedure is called unfolding) and, then we may apply the reasoning 
method for a KB with empty TBox (see Section 4.4.1). Note, however, that 
the unfolding method is inherently intractable as the unfolded KB may be- 
come of exponential size [326]. 

We also present here an alternative method, called lazy unfolding [20, 19, 
202], which it has been shown to be more efficient and that does not require 
the unfolding step. To do so, we first replace any axiom A E C € T with 
A = CN A, where A’ is a new concept name. Then, we extend out calculus 
in Table 4.3 with the rules in Table 4.4. 


TABLE 4.4: The tableau rules to deal with acyclic TBox. 


(=1). H(i) A=CeET, (îi) A € L(v), and (iii) C g L(v) then add C to L(v). 


(=>). If (i) A=CeT, (ii) AA € L(v), and (iii) nnf(AC) g L(v) then add 
nnf(=C) to L(v). 


The construction of a model Z from a complete and clash-free completion- 
forest is as for the empty TBox case. 
As before, it can be shown that 


Proposition 11. For each knowledge base K = (T, A) with acyclic T, 
1. the tableau algorithm terminates; 


2. if the expansion rules can be applied in such a way that they yield a 
complete and clash-free completion-forest, then K has a finite model; 


3. if K has a model, then the expansion rules can be applied in such a way 
that they yield a complete and clash-free completion-forest for K. 


4. the KB satisfiability problem is PSpace-complete [24, 16]. 
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(VR.ANR.B,VR.A, 3R.B) a E A 


{A} IN {A,B,C} 


FIGURE 4.3: Clash-free complete completion-forest for acyclic KB. 


Example 10. Consider the KB as in Example 10 augmented with the GCI 
B=C. 


K is satisfiable as there is a clash-free complete completion-forest (see Fig- 
ure 4.3). 
The model build from the forest is: 


e AT = {a,b,c £}; 

e the interpretation of individuals is the identity function; 
+ RF = {(a,b), (a,2)}, P? = {(a,0)}; 

e At = {b,x}, Bt = {a}, OF = {x}. 


4.4.1.3 The Case with General TBox 


Eventually, we show here how we may deal with the case in which the 
GCIs in the TBox are of the general form C E D. At first, note that we 
may assume, without loss of generality (w.l.o.g.), that any GCI is of the form 
TE D as CCE Dis equivalent to T E =C U D. Now, let us consider the rules 
in Table 4.3 extended with the rule for GCIs in Table 4.5. 


(E). If (1) TE DET and (ii) D g L(v), then add D to L(v). 


While the calculus is still sound and complete, termination is not guaranteed 
as shown by the following example. 


Example 11. Consider the KB 


K = {a:A, AC R.A} 


It can readily be shown that there are infinitely many applications of the (3) 
and (C) rules, building an infinite completion-forest (tree). 
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w L(w) 


FIGURE 4.4: Blocking in ALC. 


To cope with the non-termination problem, the notion of blocking has been 
introduced. 

Specifically, a node v is directly blocked iff none of its ancestors are blocked 
and there exists an ancestor w such that? 


L(v) = L(w) . 


In this case we say that w directly blocks v. A node v is indirectly blocked ¡ff 
one of its ancestors are blocked. Finally a node v is blocked iff it is not a root 
node and it is either directly or indirectly blocked (see Figure 4.4). 

Now, the calculus is as described above except that the rules are not applied 
to blocked nodes. Specifically, the rules are described in Table 4.5. 

Note that a model Z from a complete and clash-free completion-forest is 
built informally as follows 


e the domain of Z are the nodes of the forest that are not blocked; 
e the interpretation of individual a; is 4); 
e if A € L(v), then ve A’; 
e if RE L((v,w)) and v, w not blocked, then (v, w) € R7; 
e if RE L((v,w)) and w blocked by w’, then (v, w’) € R?. 
It can be shown that using the rules in Table 4.5 
Proposition 12. For each knowledge base K = (T, A) 


5'To be more precise, for ALC, the condition L(v) C L(w) suffices. 
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TABLE 4.5: The tableau rules for ALC KBs with GCIs. 


(M). If (i) Cı N Ca € Liv), (îi) (C1,C2) Z Lv), and (iii) node v is not 

indirectly blocked, then add C1 and C2 to L(v). 

(U). If (1) Ci U Co e Liv), (ii) (Cu, Co} N L(v) = 0, and (iii) node v is not 
indirectly blocked, then add some C € {C1, C2} to L(v). 

(V). If (i) YR.C € L(v), (ii) Re L((v,w)) with C ¢ L(w), and (iii) node v 
is not indirectly blocked, then add C to £(w). 

(3). If (i) ARC € Lv), (îi) there is no R € L((v,w)) with C € L(w), and 
(iii) node v is not blocked, then create a new node w, add R to L((v, w)) 
and add C to L(w). 

(C). If @) TE DET, (îi) D g Liv), and (iii) node v is not indirectly 

blocked, then add D to L(v). 


1. the tableau algorithm terminates; 


2. if the expansion rules can be applied in such a way that they yield a 
complete and clash-free completion-forest, then K has a finite model; 


3. if K has a model, then the expansion rules can be applied in such a way 
that they yield a complete and clash-free completion-forest for K; 


4. the KB satisfiability problem is ExpTime-complete [131, 442]. 


Example 12. Consider Example 11. K is satisfiable as there is a clash-free 
completion-forest (see Figure 4.5). Note that node y is blocked by x. 
The model build from the forest is: 


e A? = {a,x}; 

e the interpretation of individuals is the identity function; 
e RI ={(a,z), (2, 1)); 

e A? = {a,z}. 


It is not our purpose to present here the tableau rules for more expressive 
languages of the AL family, e.g., for SHOZN, SROTQ. We refer the reader 
to e.g., [203, 206, 207]. However, let us note that besides adding a rule for each 
construct, the blocking condition is extended as well to a more sophisticated 
definition, as e.g., SHZF does not employ the finite model property anymore. 

Nevertheless, as an illustrative example, in Appendix C we recall the 
tableau calculus for SHTF,. 
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FIGURE 4.5: Clash-free completion-forest from ALC KB. 


4.4.1.4 A Classification Algorithm 


For completeness, we provide here also the typical ontology classification al- 
gorithm for the A£-family. The algorithm is as from [20, 22, 23]. For a more 
recent account on it, see [170]. 

We first introduce some definitions. For some pre-order < (reflexive, tran- 
sitive relation) on some set P, let < be the precedence relation of <, i.e., < 
is the smallest relation such that its reflexive, transitive closure is <, except 
for pairs of “equivalent” objects, i.e., where a < band b < a holds. Obviously, 
x < y iff x < y and there is no z different from x and y such that x < z < y. 
The intuition here is that x < y means y subsumes x, while x < y means that 
y subsumes x and there is no other z subsumed by y and subsuming x. < is 
intended to represent the subsumption relation and ~ is intended to represent 
the hierarchy of concept names in 7, which we want to compute. If £ < y 
then x is a successor of y and y is a predecessor of x. Similarly, if x < y then 
x is an immediate successor of y and y is an immediate predecessor of x. 

Note that as < (subsumption) is a pre-order, there may be two distinct 
concepts c,c’, such that c < e! and e! < c holds, i.e., c and c’ are logically 
equivalent. As it cannot be c < € < c, we will assume that for any two 
equivalent concept names c,c’, only one is considered to occur in <. 

A concept name hierarchy < is represented as a directed acyclic graph 
where a node is labelled with a set of logically equivalent concept names and 
an edge corresponds to the immediate predecessor relation, t.e., there is a link 
from node va to node vı if x < y and x and y are in the label of vı and vz, 
respectively. 


A simple classification algorithm. The algorithm is incremental. Suppose 
we want to classify the concept names in X. Assume we have determined the 
precedence relation <; for a set X; C X of concept names. Initially, Xy = 
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IL, TY and L <o T. Consider a concept name c € XA X;. We determine a 
method to compute the precedence relation <;¿+1 for set X;41 = X;U{c}. To do 
so, we compute c’s immediate predecessors (procedure topSearch), PX£, and 
immediate successors (procedure bottomSearch), SXF, w.r.t. concept names 
in X;. Let 

PX; = (reX;|c=x) 

SX; = {xe xX, |o=<c). 
Given the set PXf and SXf, and ~<;, it is possible to compute <a on 
Xi41 = X; U {c} in linear time. In fact, one just needs to add c < x for 
ze PX£,x~<c for ze SXF, and eventually remove x < y, for x € SXF and 
y € PXS, ie., 

<im = (ulcere PXP}U{a xc] ae SXF} 
\{a@~,y|a E SXF, yE PX). 


Specifically, we compute 


TX; = {xe Xi|c< zand c É y for all y x; x} (4.4) 
BX; = f[xeX,|r<cand y Z c foral x <; y}. (4.5) 
TXF and BXf are almost PX? and 5X7, respectively. There is only a special 
case if cis equivalent to some x € Xj, t.e.,¢ < x and x < c. In that case, PXf 
is empty and TXF = {x} contains the concept x with c < x. In this case, we 
test x < c. If the test is positive, c is equivalent to x and it suffices to add c 


to the node label containing x. 
The procedure SimpleClassify(K) is as follows [20, 22, 23]: 


SimpleClassify (X) 


1. Let 
(a) X be the set of concept names in 7 
(b) Xo := {L, T} and L <o T 

IE (X \ X;) =0 return (X;, <;) and exit 

„Select c € XA X; 

. Compute TX; 

„If TXF = {x} and z < c then add c to the node label containing 
x, and go to step 2. 

. Compute BA? 

7. Set 


OP WwW N 


aD 


Xi+1 = X; U {c} 
Sim = (<¿Ule=x|reTX UL: Xc| x © BX¿)) 
\{a~:,y|uce BX; ye TĂI. 


| 
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8. Increment ¿ and go to step 2. 


We next show how to compute TX? and BĂf. We first address T'Xf. The 
other case is dual. 

To start with, we assume that the elements in X; are represented as a 
concept hierarchy, as explained previously. To determine TA, our procedure, 
called topSearch, starts with T € X, and visits the concept hierarchy X, in a 
top-down breadth-first fashion. That is, 


TX; := topSearch(c, T) . 


For each concept name x € X; under consideration, it determines whether x 
has an immediate successor y satisfying c < y. If there are such successors, 
they are considered as well. Otherwise, x is added to the result list of the 
topSearch algorithm. 

To avoid multiple visits of elements of X; and multiple comparisons of 
the same element c, topSearch employs the label “visited” and another label 
“positive” if the subsumption test has been made. The procedure topSearch 
gets two concepts as input, the concept c, which have to be inserted and the 
element x € X; currently under consideration. For x, we already know that 
c < x and we look at direct successors of x w.r.t. <;. For each direct successor 
y of x, we have to check whether y subsumes c. This is done with the procedure 
enhancedTopSubs. The direct successors for which the test was positive are 
collected in a list PosSucc. If the list remains empty, x is added to the result 
list; otherwise topSearch is called for each positive successor, if not already 
visited. 

The topSearch(c, x) algorithm is as follows: 


topSearch(c,x) 


1. Visited(c, x):= true 


2. For all y with y =; x do: if enhancedTopSubs(y,c) then 
PosSucc := PosSuccU {y} 


3. If PosSucc = Ú then Result := {x} and go to step 5. 


4. For all y € PosSucc do: if not Visited(c, y) then Result := ResultU 
topSearch(c, y) 


5. Return Result. 
The enhancedT opSubs(y, c) algorithm is as follows: 


enhanced TopSubs(y,c) 


1. if SubsFlag(y, c) = “positive” then Result := true and go to step 
5. 


2. If SubsFlag(y, c) = “negative” then Result := false and go to step 
5. 
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3. If for all z with y =; z it holds that enhancedTopSubs(z,c) and 
c < y then SubsFlag(y, c) := “positive” , Result := true and go to 
step 5. 

4. SubsFlag(y, c) := “negative”, Result := false 


5. Return Result. 


Note that in step 3, before testing the subsumption c < y we apply the 
following heuristics using negative information [20, 22, 23]: 


If for some predecessor z of y the test c < z has failed, we can 
conclude that c £ y without performing the expensive test c < y. 
To gain maximum advantage, all predecessors of y should have 
been tested before the test is performed on y, which is obtained 
by recursive calls. 


(20, 22, 23] address also the use of positive information: before checking c < y, 
check if there is some successor z of y that has passed the test c < z and in 
this case c < y holds without performing the expensive test c < y. However, 
(20, 22, 23] show that the use of negative information is much better than the 
use of positive information. 

We next show how to compute BX¥, which is dual to the topSearch pro- 
cedure and is performed by the bottomSearch algorithm. We have that 


BX; := bottomSearch(c, L) 
bottomSearch(c,x) 


1. Visited(c, x):= true 


2. For all y with x ~<; y do: if enhancedBottomSubs(c, y) then 
PosPrec := PosPrecU {y} 


3. If PosPrec = @ then Result := {x} and go to step 5. 


4. For all y e PosPrec do: if not Visited(c, x) then Result := ResultU 
bottomSearch(c, y) 


5. Return Result. 
The enhanced BottomSubs(c, y) algorithm is as follows: 


enhancedBottomSubs(c,y) 
1. if SubsFlag(c, y) = “positive” then Result := true and go to step 


2. If SubsFlag(c, y) = “negative” then Result := false and go to step 
5. 

3. If for all z with z <; y it holds that enhancedBottomSubs(c, z) 
and y < c then SubsFlag(c, y) := “positive” , Result := true and 
go to step 5. 
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FIGURE 4.6: A DL classification run. 
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4. SubsFlag(c, y) := “negative”, Result := false 
5. Return Result. 


Note that in step 3, before testing the subsumption y < c we apply the 
following heuristics using negative information [20, 22, 23]: 


If for some successor z of y the test z < c has failed, we can 
conclude that y £ c without performing the expensive test y < c. 
To gain maximum advantage, all successors of y should have been 
tested before the test was performed on y, which is obtained by 
recursive calls. 


Example 13 ([170]). Consider the KB 


K = {C C JR.D, IRTE). 


Figure 4.6 illustrates a classification run. 


Optimizing classical classification via told subsumers. Consider a KB 
K = (T, A). We introduce the notion of told subsumer [19]. 


1. If 7 contains A E C or A = C then C is called a told subsumer of A, 
denoted A >, C. 


2. If DE C,N...NC, € T then for all i, D— Ci. 
IfA=C,M...MC, € T then for all i, A 4, Ci. 
If C,U...UC, E DET then for all i, C; 34, D 


If A=C,U...UC, € T then for all i, Ci 34, A. 


See O 


. ts is transitive 
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Now, we may exploit the notion of told subsumers by applying the following 
heuristics: 


Remark 7. Before classifying concept A, all of its told subsumers which have 
already been classified, and all their subsumers, can be marked as subsumers 
of A. Subsumption test for these atoms are, thus, unnecessary. That is, in the 
procedure SimpleClassify, we have another initialization step: 


if A >s B then set SubsFlag(B, A) = ‘positive’. 


To maximize the effect of told subsumer optimization, atoms should be classi- 
fied in definitional order. This means that a concept A is not to classify until 
all of its told subsumer have been classified. Specifically: 


Remark 8. In procedure SimpleClassify, in step 3, select concept c € XX; 
only if all of its told subsumer have been classified. That is, 


if A >ts B then classify B before A. 


Therefore, we get a preference order on the atoms in X. 


The acyclic KBs case. Consider an acyclic KB K = (T, A). For acyclic 
KBs we may take advantage of the heuristics developed previously, concerning 
told subsumers. Specifically, apply the SimpleClassify procedure with the 
two heuristics described in Remark 7 and 8. Then the ordering induced by 
Remark 8 can be exploited by omitting the bottom search phase for primitive 
concept names and assuming that they only subsume (concepts equivalent) 
to L. This is possible because, with an acyclic KB, a primitive concept can 
only subsume concepts for which it is a told subsumer. Therefore, as concepts 
are classified in definition order, a primitive concept will always be classified 
before any of the concepts it subsumes (and, thus, the bottom search phase 
is redundant). 

Note that this additional optimization cannot be used with a general KB 
because, in the presence of general GCIs, it can no longer be guaranteed that a 
primitive concept will only subsume concepts for which it is a told subsumer. 
E.g., for 


T ={ACIARC,AR.CCB}, 


B is not a told subsumer of A, but B subsumes A. 


4.4.2 The Case of the €£ Family 


For the sake of ease of presentation, we provide here the calculus for 
EL** (we refer the reader to [13, 18] for €L**(D) and its extensions). Essen- 
tially, we drop from €£**(D) the concrete concept expressions of the form 


p(f,. os fa): 
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We have seen that for the AL-family the classification algorithm is built 
on top of the subsumption test algorithm. The main feature of the reasoning 
algorithm of E L** is instead the fact that it is a polynomial-time subsumption 
algorithm that actually directly classifies a given TBox 7, i.e., simultaneously 
computes all subsumption relationships between the concept names occurring 
in 7, which makes it particularly useful for large size ontologies. 

Note also that the other decision problems can be reduced to the subsump- 
tion problem as well [14]. Let K = (A, T) be an EL** KB. Then 


e C is coherent iff K | a:C, for a new individual a; 


e XK is satisfiable iff K j a:1; 


e Concerning the instance problem, we convert an ABox A into a concept 
Cu as follows 


Ca:= [|] 2U(anon []  3U.((a) Mar.) 
a: Aca (a,b):Rea 


where U is a new role not occurring in A. Then 


KE aC iff TE {a}NC,EC. 


So, in the following we consider the subsumption problem only w.r.t. a TBox 
T 


The algorithm proceeds in four steps: 

1. Normalise the TBox. 

2. Translate the normalized TBox into a graph. 
3. Complete the graph using completion rules. 


4. Read off the subsumption relationships from the normalized graph. 


Remark 9. Please, note that [243, 244] has shown that the TBox classifica- 
tion calculus for ELF*, as illustrated in [13, 18], is incomplete in the presence 
of nominals in the TBoz, e.g., given the TBoz [229], (but see also [244] for 
other examples) 


T =4 ACARA(BN {o}) 


AE 35.10) 
35BCB), 


this TBox entails A E B, because if A is not empty then o is an instance of 
B, but the algorithm in [13, 18] is not able to infer it (the problem is in the 
(CR6) illustrated later on). 
For further insights we refer the reader to [229, 230, 243, 244, 245, 297], 
and specifically to [230, 229] for a complete and similar to [13, 18] calculus. 
However, even if incomplete, it is still illustrative to present the work as 


from [13, 18]. 
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TABLE 4.6: NNF transformation rules for E£** TBoxes. 


NF1 R¡o--*-oR. ES > {Ri o---oR,1 CU, OR, LS) 


NF2 CONDEE>S(DEACNALE) 


NF3 JR.CO DH {CL A,IR.AEL D} 


NF4 1ED>0 


E 


NF5 ČC DH {CL A, AC D} 
B 


NF6 


JR. (BE 3R.A, ALC) 


NF7 BCOCND4{BLC,BLD} 


where C, D ¢ By, U new role name, and A new concept name. 


So, let By to denote the smallest set of concept descriptions that contains 
the top concept T, all concept names used in 7, and all concept descriptions 
of the form {a}. 

We say that a T is normalized if 


1. it only contains GCIs of the following form: 


ACB, A¡MA¿CB, ACARB, ARACB 


where A, A; € By and B € BrU{Ly}; 


2. all role inclusions are of the form RES or Ryo Ro LS. 


By introducing new concept and role names, any TBox 7 can be turned into 
a normalized TBox 7” that is a conservative extension of T, i.e., every model 
of T’ is also a model of 7, and every model of T can be extended to a model of 
T’ by appropriately choosing the interpretations of the additional concept and 
role names and it is shown that this transformation can actually be done in 
linear time [14]. The conversion into normal form is done using the translation 
rules shown in Table 4.6 in two phases: 


1. exhaustively apply rules (NF1) — (VF4);* 
2. exhaustively apply rules (NF5) — (NF7). 


So, let us now assume that 7 is normalized. When developing the subsumption 
algorithm for normalized E£*+ TBoxes, we can restrict our attention to sub- 
sumption between concept names. In fact, 7 = C E Diff T’ H AC B, where 


6Note that in phase 1, the rule (NF 2) is applied modulo commutativity of conjunction. 
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T’' =TU{AEC,DC B}, with A,B new concept names. The subsumption 
algorithm not only computes subsumption between two given concept names 
w.r.t. the normalized input TBox 7; it rather classifies 7, i.e., it simultane- 
ously computes the subsumption relationships between all pairs of concept 
names occurring in 7. 

So, let Ry to denote the set of all role names used in 7. The algorithm 
computes a mapping S from By to a subset of Br U{T, LI, and a mapping 
R from Ry to a binary relation on By. The intuition is that these mappings 
make implicit subsumption relationships explicit in the following sense: 


e De S(C) implies that T |= C E D; 


e (C, D) € R(P) implies T EC E AP.D. 


These mapping are initialized as follows: 
e S(C) := {C,T} for each C € Br; 
e R(P) :=0 for each Pe R7. 


Then the sets S(C) and R(P) are extended by applying the completion rules 
shown in Table 4.7 until no more rule applies. Some of the rules use ab- 
breviations that still need to be introduced. Indeed, (CR6) uses the rela- 
tion RC Br x Br, which is defined as follows: C ~>r D iff there are 
C1,..., Ck € By such that 


e Cı = C or Cı = {b} for some b; 
e (Cj, Cj+1) E R(R;) for some Rj € Ry (1< j< k); 
e C =D. 

The following can be shown [13]. 

Proposition 13. For a normalized TBox T, 


1. the rules of Table 4.7 can only be applied a polynomial number of times, 
and each rule application is polynomial; 


2. Let S be the mapping obtained after the application of the rules of Ta- 
ble 4.7 to T has terminated, and let A, B be concept names occurring in 
T, then T KE ACB if” one of the following two conditions holds: 


e S(A)N{B, 1} 4 0; 
e there is {a} € Br such that Le S({a}). 


We refer the reader to [293] for a conjunctive query answering procedure for 
EL. 


“The other direction does not hold as from Remark 9, but holds if nominals are missing 
in the TBox. 
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TABLE 4.7: Completion rules for EL**. 


(CR1) If C e S(C),C'E DET, and D ¢ S(C) then add D to S(C); 

(CR2) If C,,C2 € S(C),C¡ NC E DET, and D ¢ S(C) then add D to 
S(C); 

(CR3) If C” € S(C),C’ CAP.D€T, and (C, D) g R(P) then add (C, D) to 
R(P); 


(CRA) If (C, D) € R(P),D' € S(D),3P.D' E E € T, and E q S(C) then 
add E to S(C); 


(CR5) If (C, D) e R(P), Le S(D), and Lg S(C) then add L to S(C); 
(CR6) If {a} € S(C)NS(D),C ~r D, and S(D) Z S(C) then join S(D) to 
S(C); 


(CRR1) If (C, D) € R(P),P E S € T, and (C, D) ¢ R(S) then add (C, D) 
to R(S); 


(CRR2) If (C, D) € R(R1),(D, E) € R(R2), Rio Ro E Rz € T, and (C, E) ¢ 
R(R3) then add (C, E) to R(Rg3). 


Example 14. Consider the KB of Example 13, i.e., 


K = {C C 3R.D,3R.T C E}. 


Note that K is already normalized. The following illustrates a classification 
run. 


1. Initialization: 


S(C) := {C,T} 
S(D) := {D,T} 
S(E) := {E,T} 
R(R) = 0. 
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FIGURE 4.7: An EL ** classification example. 


4. Stop. Figure 4.7 illustrates the final classification, which coincides with 
the one in Figure 4.6. 


4.4.3 The Case of the DL-Lite Family 


We recall that DL-Lite-family [12, 82, 83, 87] has been specifically tailored 
to capture some basic ontology language features, while keeping a low com- 
plexity of reasoning. Reasoning means not only computing the subsumption 
relationships between concepts, and checking satisfiability, but also answering 
complex queries (î.e., conjunctive queries) over a huge set of instances. 

In the following, we restrict our attention to DL-Litecore [82], which is 
sufficient to highlight the main concepts for reasoning within the DL-Lite- 
family. 

Consider a DL-Litecore KB K = (T, A) and a conjunctive query q(x) — 
Jy.p(x, y). The main reasoning tasks related to K and q are: 


e Knowledge base satisfiability problem; 

e Subsumption checking; 

e Concept coherence; 

e Instance checking; 

e Query answering, i.e., computing the answer set ans(K, q). 


In the following, define an r-concept as any concept that can occur on the 
right-hand side of a concept inclusion. 

Note that the subsumption problem can be reduced to the KB unsatisfia- 
bility problem. In fact, for a DL-Litecope axiom C E D , in order to determine 
whether K = C E D, consider a new concept name A, a new individual a, 
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and set K’ = (T”, A’), where 


T = TU[ACDC,A 
A := Avf{a:A}. 


=D} 


Then, it is easy to see that K E CE D iff K’ is not satisfiable. 
Determining whether a r-concept C is coherent can be reduced to the 


KB satisfiability problem as follows: consider a new concept name A, a new 
individual a, and set K’ = (T”, A’), where 


T t= SPUI CAE) 
A := Auf{a:A}. 


Then, it is easy to see that C is coherent iff K’ is satisfiable. 
Concerning instance checking, i.e., determining whether K = a:C for r- 
concept C, consider a new concept name A and set K’ = (7', A’), where 


T TU{AL-C} 
A := Avf{a:A}. 


II 


Then, it is easy to see that K - a:C iff K’ is not satisfiable. 
Therefore, it remains to address the KB satisfiability problem and the 
query answering problem. At first, however, we need a KB normalization step. 


Knowledge Base normalization. The normalization of K = (T, A) is ob- 
tained by transforming X as follows. The ABox A is expanded by adding to 
A the assertions a:3R and b:3R”, for each (a,b):R € A. 

Then 7 is expanded by closing it with respect to the following inference 

rule: if Bı E By € T and either Ba E ~B € T or B3 E AB € T then add 
Bı E ~B; to T. It can be shown that after computing the above closure we 
have that T = Bı E Ba iff either Bı E —B> € T or Bp CAB, ET. 
Now we store A in a relational database. That is, (i) for each basic concept 
B occurring in A, we define a relational table tabg of arity 1, such that 
(a) € tabg iff a:B € A; and (i) for each role P occurring in A, we define a 
relational table tabp of arity 2, such that (a,b) € tabp iff (a,b):P € A. We 
denote with DB(A) the relational database thus constructed. 


Knowledge base satisfiability. To check the satisfiability of a normalized 
KB K = (T, A), we verify the following condition: there exists B, E ~B € T 
and a constant a such (a:B,,a:B2) C A. If this condition above holds, then 
K is not satisfiable. Otherwise, K is satisfiable. Note that the algorithm can 
verify this condition by posing to DB(A) a simple conjunctive query expressed 
in SQL query, i.e., K is not satisfiable iff 


q(x) E tabg, (x), tabp, (x) 


has non-empty answer in DB(A). 


Web Ontology Language OWL 67 


Notice that the algorithm does not consider the PIs (positive inclusions) 
occurring in 7 during its execution. Indeed, it can be shown that PIs do not 
affect the satisfiability of a DL-Lite KB, if the TBox is normalized [82]. 

It can be shown that the above described algorithm decides the KB satis- 
fiability problem in polynomial time [82]. 


Query answering. We show now how to determine the answers of a con- 
junctive query over K. To this end: 


1. We have to check if K is satisfiable, as querying a non-satisfiable KB is 
undefined in our case. 


2. By considering 7 only, the user query q is reformulated into a set of 
conjunctive queries r(q, T). Informally, the basic idea is that the refor- 
mulation procedure closely resembles a top-down resolution procedure 
for logic programming, where each inclusion axiom B E Ba is seen as a 
logic programming rule of the form B2(x) — B(x). For instance, given 
the query q(x,s) < A(x) and suppose that T contains the inclusion 
axioms Bı E A and Ba E A, then we can reformulate the query into 
two queries q(x, s) E Bı(x) and q(x, s) — B(x), exactly as it happens 
for top-down resolution methods in logic programming. 


3. The reformulated queries in r(q,T) are then evaluated over A only 
(which is stored in a database), producing the requested answer set 
ans(K, q). 


The query reformulation step is as follows. We say that a variable in a con- 
junctive query is bound if it corresponds to either a distinguished variable or 
a shared variable, i.e., a variable occurring at least twice in the query body 
(inclusive the scoring function), or a constant, while we say that a variable 
is unbound if it corresponds to a non-distinguished non-shared variable (as 
usual, we use the symbol “_” to represent non-distinguished non-shared vari- 
ables). Note that an atom of the form (3P)(x) (resp. (AP )(x)) has the same 
meaning as P(x,_) (resp. P(_,x)). For ease of exposition, in the following we 
will use the latter form only. 

An axiom 7 is applicable to an atom B(x), if r has B in its right-hand side, 
and 7 is applicable to an atom P(x, £2), if either (i) x2 = _ and the right-hand 
side of 7 is SP, or (îi) xı = - and the right-hand side of 7 is P~. We indicate 
with gr(g;7) the atom obtained from the atom g by applying the inclusion 
axiom 7. Specifically, if g = Bi(x) (resp., g = Pi(a,-) or g = Pi(-,x)) and 
T = B E B; (resp., 7 = By CAP, or T = B E AP, ), we have: 


e gr(g,7) = A(x), if By = A, where A is an atomic concept; 


e gr(g,T) = Polo, =); if Ba = JP»; 


e gr(g, T) = Pa(_,1), if B2 =3P, . 
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We are now ready to present the query reformulation algorithm. Given a 
conjunctive query q and a set of axioms 7, the algorithm reformulates q in 
terms of a set of conjunctive queries r(q, T), which then can be evaluated over 


DB(A). 


Algorithm 1 QueryRef (q, T) 


Input: Conjunctive query q, normalized KB K. 
Output: Set of reformulated conjunctive queries r(q, T). 


1: r(q, T) := {a} 


2: repeat 

3: S =r(q,7) 

4: for all q € S do 

5: for all y € q do 

6: if r € T is applicable to g then 

T: r(q, T) := r(q, T) U falg/gr(g, 7)]} 
8: for all g1, g2 € q do 

9: if gı and g2 unify then 


10: r(q, T) := r(q, T) U [x(reduce(g, g1, 92))) 
11: until S = r(q, T) 
12: return r(q,T) 


In the algorithm, q[9/g'] denotes the query obtained from q by replacing the 
atom g with a new atom g’. At step 8, for each pair of atoms ga, ga that unify, 
the algorithm computes the query q' = reduce(q, 91, 92), by applying to q the 
most general unifier between gı and g2.8 Due to the unification, variables that 
were bound in q may become unbound in qg’. Hence, inclusion axioms that 
were not applicable to atoms of q, may become applicable to atoms of q” (in 
the next executions of step (5)). Function « applied to q’ replaces with _ each 
unbound variable in g’. 

The main property of the query reformulation algorithm is as follows. It 
can be shown that 


ans(K,q) = {c | qi E r(q, T), A H a(c)} - 


The above property dictates that the set of reformulated queries q; € r(q, T) 
can be used to find the answers, by evaluating them over the set of instances 
A only, without referring to the ontology 7 anymore. Note, however, that the 
size of r(q, T) may be exponential w.r.t. T. 

It can be shown that the above described algorithm computes correctly 
the answer set of a conjunctive query. 

From a computational complexity point of view, it can be shown that 


Proposition 14 ([82]). Satisfiability of a KB K can be decided in time poly- 
nomial in the size of K. Conjunctive query answering is in LogSpace in data 
complexity, while is NP-complete in combined complexity. 


Proposition 14 extends also to the case of disjunctive queries, to DL-Litez n, 
DL-Liter mn, DLR-Lites n and DLR-Liter n [83]. 


8 We say that two atoms gı = r(x1,...,tn) and g2 = r(y1,...,Yn) unify, if for all i, 
either x; = yi or x; = - or yi = -. If gi and g2 unify, then the unification of gı and g2 is 
the atom r(z1,...,2n), where 2; = x; if x; = y; or yi = -, otherwise z; = y; [80]. 
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Example 15 ([410]). Suppose we have the following information about hotels 
and conferences. 


Accommodation ©  JHasALoc 
Accommodation ©  JHasAPrice 
Conference ©  JHasCLoc 
Hotel [E Accommodation 
Hotelx [© Accommodation 
Hotelxx [© Accommodation 
Hotelxxxxx E. Accommodation 
HasALoc HasCLoc HasHPrice ] 
AID | HasLoc ConfID | HasLoc AID | Price 
al hli ci eli al 150 
a2 |h12 c2 c12 a2 | 200 | 


We may ask to find accommodations, their location, and price. The query may 
be expressed as 


g(a, hl, p) —Accomodation(a), HasHLoc(a, hl), HasHPrice(a,p) . 
Then the following is the set of query rewritings r(q, T): 


q(a,hl,p) + Accomodation(a), HasHLoc(a, hl), HasHPrice(a, p) 
q(a,hl,p) +  Hotel(a),HasHLoc(a, hl), HasHPrice(a, p) 
q(a,hl,p) +  Hotelx(a),HasHLoc(a, hl), HasHPrice(a, p) 


q(a,hl,p) +  Hotelxxxxx(a), HasHLoc(a, hl), HasHPrice(a,p) . 


4.4.4 The Case of the Horn-DLs Family 


A main feature of the Horn-DL family is its close relationship with logic pro- 
gramming and, indeed, reasoning is performed via a translation of Horn-DL 
statements into a logic programming language. In fact, [339] shows how OWL 
2 RL can be implemented using the rule language RIF (Core).? 

For the sake of ease of presentation, we are not going to present the whole 
mapping for Horn-DL, but for a significant subset only that is sufficient to 
illustrate the main idea behind this translation. The Horn-DL language we 
consider here is 


B => A|B, Bə | Bı U Bə | 3R.B 


where inclusion axioms have the form 


9A partial mapping is provided also in [341]. 
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IS 


Assertions are of the form a:C, (a, b):R. 

Our mapping follows [173]. In the following, a rule [263] is as a conjunctive 
query with the exception that an unary or binary predicate occurs in the rule 
head. A fact is a rule F — with empty body. Facts are denoted F as well 
(omitting the symbol —).*% 

We now define a recursive mapping function o which takes inclusions ax- 
ioms and assertions and maps then into the following expressions: 


O(R E Re) > Orole(Ra,t,Y) — Orole(R1, x,y) 
Trole(R,t,Y) +> R(z,y) 
or(R”,z,y) +> Ry,2) 
o(BEC) > op(C,x) < ov(B) 
on(A,xz) > A(x) 
on(C1 N C2,£) > on(Ci, £) A on(Co, x) 
on(VR.C,x2) > on[C,t) E Oroie(R, x,y) 
ol A,x) = A(x) 
(CNC) > lC, £) A (C2, x) 
on(Cy UC2,£) > olC1, 2) V (C2, x) 
ol IR.C, £) => Orote(R, £, Y) A lC, y) 
o(a:0) => onl(C,a) = 
olla, b): R) = GroielR, a,b) = 


where y is a new variable. 
We then transform the above generated expressions into rules by applying 
recursively the following mapping: 


or((H^A H')+ B) > o,(H + B), o, (H' + B) 


or((H + H) B) >» o,(H &(B^H')) 
o,(H ~ (Bı V B2)) E or (H = Bı), o,(H —= Ba) 


Eventually, if none of the above three rules can be applied then 


(HB) > HB. 


10See Chapter 5. 
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For instance, the GCI 


ANAR.CC BOVP.D 


is first mapped into the expression (via 0) 
B(x) A (D(x) — Plz,z)) — A(x) A R(x, y) AC(y) 


that then is transformed into the two rules (via 0,) 


It can be shown that 


Proposition 15 ([173]). The above described transformation preserves se- 
mantic equivalence. That is, let K be a KB and Px be the rule set that results 
from applying the transformation to all axioms in K, then Px is logically 
equivalent to K w.r.t. the semantics of FOL Px has the same set of models 
and entailed conclusions as K. 


Hence, reasoning in Horn-DL can be reduced to reasoning within logic pro- 
gramming (see Chapter 5). 


4.4.5 Reasoning Complexity Summary 
We resume in Table 4.8 here the complexity of various reasoning task for 


the OWL family. 


TABLE 4.8: The complexity of various reasoning task for the OWL family. 


Language | Taxonomic Complexity | Data Complexity | Combined Complexity 
OWL 2 2NExpTime-complete Decidable 2NExpTime-complete 
OWL 2 EL PTime-complete PTime-complete PTime-complete 
OWL 2 QL | NLogSpace-complete ACO NLogSpace-complete 
OWL 2 RL P'Time-complete P'Time-complete PTime-complete 
OWL DL 2NExpTime-complete Decidable NExpTime-complete 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


Chapter 5 


Rule Languages 


5.1 Introduction 


The Rule Interchange Format (RIF) [365] is the third building block of 
the infrastructure for the Semantic Web, along with (principally) SPARQL, 
RDF, and OWL. Although originally envisioned by many as a rules layer for 
the Semantic Web, in reality the design of RIF is based on the observation 
that there are many rules languages in existence, and what is needed is to 
exchange rules between them. That is, RIF focuses on exchange rather than 
trying to develop a single one-fits-all rule language because, in contrast to 
other Semantic Web standards, such as RDF, OWL, and SPARQL, it was 
immediately clear that a single language would not satisfy the needs of many 
popular paradigms for using rules in knowledge representation and business 
modelling (see also the case of RuleML [63]). 

A rule is perhaps one of the simplest notions in computer science, specifi- 
cally FOL: it is an IF - THEN construct, i.e., a Horn formula (named after the 
logician Alfred Horn). If some condition (the IF part) that is checkable in some 
dataset holds, then the conclusion (the THEN part) is processed. Deriving 
somewhat from its roots in Logic, rule systems use a notion of predicates that 
hold or not of some data object or objects. For example, the fact that two peo- 
ple are married might be represented with predicates as Married(lisa, john). 
Married is a predicate that can be said to hold between individuals lisa and 
john. Adding the notion of variables, a rule could be something like: 


if Married(x, y) then Loves(x, y) . 


We would expect that for every pair of (x,y) (e.g., lisa and john) for which 
the Married predicate holds, using this rule we would conclude that the Loves 
predicate holds for that pair as well. 

As pointed out in Section 2.3, RIF includes three dialects, a Core dialect 
that is extended into a Basic Logic Dialect (BLD) and Production Rule Dialect 
(PRD). 


RIF-Core RIF-Core corresponds to Horn logic without function symbols 
(î.e., Datalog [449]) with a number of extensions to support features 
such as objects and frames as in F-logic [238]. RIF-Core is a subset of 
RIF-BLD and RIF-PRD. 
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RIF-BLD From a theoretical perspective, RIF-BLD corresponds to the lan- 
guage of definite Horn rules with equality and a standard first-order 
semantics [93]. RIF-Core is a a specialization of RIF-BLD. 


RIF-PRD It aims at capturing the main aspects of various production rule 
systems. Production rules have an if part, or condition, and a then part, 
or action. The condition is like the condition part of logic rules (as cov- 
ered by RIF-Core and its basic logic dialect extension, RIF-BLD). The 
then part contains actions. An action can assert facts, modify facts, re- 
tract facts, and have other side effects. In general, an action is different 
from the conclusion of a logic rule, which contains only a logical state- 
ment. However, the conclusion of rules interchanged using RIF-Core can 
be interpreted, according to RIF-PRD operational semantics, as actions 
that assert facts in the knowledge base. 


As we did for RDFS and OWL 2, where we were looking at the logical founda- 
tions of these languages, we present here the logical foundations behind RIF, 
too. Specifically, we present here Datalog [2, 449] with concrete datatypes 
(compare with OWL 2 concrete domains in Section 4.2.2.2), which is at the 
heart of RIF-Core, the common ingredient between RIF-BLD and RIF-PRD, 
and is sufficient to illustrate the main characteristics of the RIF family. 

Appendix D describes RIF-Core and the mapping of a significant sub- 
language of RIF-Core to Datalog. 


5.2 Datalog Basics 


Datalog is a rule and query language for deductive databases that syntac- 
tically is a subset of Logic Programs [263] and became prominent as a separate 
area around 1977 when Hervé Gallaire and Jack Minker organized a workshop 
on logic and databases [166]. David Maier is credited with coining the term 
Datalog [2]. 

A knowledge base K = (F,P) consists of a finite facts component F and a 
finite LP component P, which are both defined below. 


Facts Component. F is a finite set of atoms of the form 


Peira Cr) 5 
where p is an n-ary relation and every c; is an individual (also called constant). 
For each p, we represent the facts p(c1,..., Cn) in F by means of a relational n- 
ary table Tp, containing the records (c1,...,Cn). We assume that there cannot 


be two records representing the same tuple (if there are, then we remove one 
of them). 
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Example 16 ([358]). Suppose we have a car-selling site, and we would like 
to buy a car. The facts are the cars, which belong to the relation CarTable 
shown below. 


CarTable 
ID MODEL TYPE PRICE KM COLOR AIR AIR ENG DISCOUNT 
BAG COND FUEL 
455 MAZDA 3 Sedan 12500 18000 Red 0 1 Gasoline 0.1 
34 ALFA 156 Sedan 12000 17000 Black EA 0 Diesel 0.2 
1812 FORD FOCUS Estate 13000 16000 Gray șI 1 Gasoline 0.2 


Rule Component. P is a finite set of rules, which are similar as conjunctive 
RDFS and DL queries (See Sections 3.3, 4.3 and 6), and are of the form 


p(x) — Jy.p(x, y) , (5.1) 


where now p(x, y) is a conjunction! of n-ary predicates p;(z;) and z; is a vector 
of distinguished or non-distinguished variables. Specifically, we say that p(x) 
is the head and Jy.p(x, y) is the body of the rule, x is a vector of variables 
occurring in the body, called the distinguished variables, y are so-called non- 
distinguished variables and are distinct from the variables in x, each variable 
occurring in p; is either a distinguished or a non-distinguished variable. If 
clear from the context, we may omit the existential quantification Jy. We 
also assume that predicate names in a rule body are distinct, t.e., p; 4 pj for 
i Æ j. The intended meaning of a rule such as (5.1) is that the head p(x) is 
true whenever the body y.p(x, y) is true. 

We also assume that relations occurring in F do not occur in the head 
of rules (so, we do not allow that the fact relations occurring in F can be 
redefined by P). As usual in deductive databases, the relations in F are called 
extensional relations, while the others are intensional relations. 


Example 17. Consider again Example 16. An excerpt of the domain ontology 
is described below and partially encodes the web directory behind the car selling 
site www. autos. com. 


Vehicles(x)  Cars(x) Cars(x) + Estate(x) 

Vehicles(x) + Trucks(x) PassengerCars(1) + MidSizeCars(x) 
Vehicles(x) + Vans(x) PassengerCars(1) + SportyCars(x) 
Cars(x) + LuxuryCars(x) PassengerCars(1) + CompactCars(1) 
Cars(z) + PassengerCars(x) hasPrice(z, x4) + CarTable(x1,...,t9) 
Cars(z1) + CarTable(x1,...,29) hasKm(11, 25) + CarTable(x1,...,29) 
Cars(x) + Sedan(x) FuelType(11,19) + CarTable(x1,...,x9) 


Remark 10. Note that we impose relations p; in a rule body to be distinct. 
This is not a limitation as we may rewrite, e.g., 


p(x) — q(x, y), a(y, 2) 


as 


1 Again we use the symbol “,” to denote conjunction in the rule body. 
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p(x) — ale, y), p' (y, 2) 
p'(y, z) = aly, 2) 
for a new relation symbol p'. 


We may write a fact p(c) also as a rule of the form 
pc) 1, (5.2) 
where 1 in the body is a ground atom, which is always interpreted as true. 


Semantics. From a semantics point of view, given K = (F,P), the notions of 
Herbrand universe Hy (the set of all constants occurring in K) and Herbrand 
base Bx of K (the set of all ground atoms that can be formed using constants 
in Hx and atoms occurring in K) are as usual. 

Additionally, given K, the set of ground rules K* derived from the ground- 
ing of K is constructed as follows: 


1. set K* to be {p(c) — 1 | p(c) € F}; 
2. add to K* the set of all ground instantiations of rules in P. 


An interpretation T for K is a subset of Be. Given K, we say that T satisfies 
(is a model of) 


e the ground atom 1, denoted Z = 1; 


e a ground atom A € Bx, denoted T = A, iff A € T; 


e a ground rule body ¢ of a rule A + ge K*, denoted T = 4, iff Z is a 
model of all atoms in @; 


e a ground rule r € K*, denoted T f r, iff Z is a model of the head of r 
whenever Z is a model of the body of r; 


e K*, denoted Z |= K*, iff Z satisfies all rules r € K*; 
e KÇ, denoted T — K, if Z is a model of K*. 


Let Ik = 28 be the set of all interpretations (there are 2!?*! many). Now, 
for Zu, Do € Ix , we write Di < Ig iff Zi C Za. It is easy to see that (Ix, <) is 
a finite complete lattice. 

Among all the models, one model plays a special role: namely the <-least 
model Mx of K. The existence, finiteness, and uniqueness of the minimal 
model Mx is guaranteed to exist by the following argument (see, e.g., [263]). 

Consider the function Tx: Ik — Ik defined as 


Tx(Z) :={A| Ae de K* st. TE Q}. (5.3) 


Then, it can be shown that Tx is monotone on Ix, i.e., if Zi < Z then 
Tx (Zi) < Tc (Z2). By the well-known Tarski-Knaster fixed-point theorem [439] 
for monotone functions over complete lattices, we get immediately: 
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Proposition 16. For a knowledge base K, there exists a unique minimal 
model Mx that is the least fixed point of the function Tk. Mx can be obtained 


as the limit of the <-monotone sequence, Zp,...,Z;,..., where 
To = F 
5.4 
Liy = Tkt). (5.4) 


The minimal model is attained in at most 218| Tk iterations. 


5.3 Concrete Domains 


Similarly to DLs (see Section 4.2.2.2), we may extend Datalog with con- 
crete domains, to deal with datatypes, such as strings and integers. The for- 
malization is similar to DLs, 1.e., we assume a set of constants that are data 
values and a set of datatype predicates, where each datatype predicate has 
a predefined arity n > 1. A datatype theory D= (AP, .P) consists of a finite 
datatype domain AP and a mapping -P that assigns to each data value an 
element of AP and to each datatype predicate p of arity n a relation over AP 
of arity n. 

A concrete atom is an expression of the form d(z), where z is a vector of 
variables or data values. We extend an interpretation Z to concrete atoms by 
saying that Z satisfies (is a model of) a ground concrete atom d(v), denoted 
ZF d(v) iff v e dP. For instance, >20 may be a unary predicate denoting the 
set of integers greater or equal to 20. For convenience, we write “functional 
predicates”? as assignments of the form x :=f (z) and assume that the function 
f(z) is safe. 

Rules are then extended by allowing concrete atoms to occur in a rule body 
with the condition that the concrete atom is safe, i.e., a variable occurring in 
a concrete atom does also occur in non-concrete atoms, of the same rule. For 
instance, the 


GoodHotel(x,r,p) += Hotel(x), HasRoomPrice(x,r,p), Sao(p) (5.5) 


has intended meaning to define a good hotel as one having a room with a 
price not larger than 80€. 

Next, let us extend both the Herbrand universe as H, 2 := He U AP and 
the Herbrand base BP to the set of all ground atoms and ground concrete 
atoms that can be formed from the constants in the Herbrand universe. 

An interpretation Z is extended by being a subset of BP with the addi- 
tional condition that d(v) € Z iff Z = d(v). Of course, (2BR , <) is still a finite 
complete lattice (recall that AP is finite). 


2 A predicate p(x, y) is functional if for any v there is unique v’ for which p(v, v’) is true. 
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Now, the notions of satisfaction (is model of) are extended in the obvious 
way to rules including concrete atoms. It is then straightforward to see that 
the analogue of Proposition 16 still applies to interpretations over BP in place 
of Bx. 


5.4 Conjunctive Queries 


A query is a rule 


q(x) — Jy.p(x,y) , (5.6) 


and the answer set of a query q w.r.t. a set K of facts and rules (denoted 
ans(K, q)) is the set of tuples t such that there exists t’ such that the in- 
stantiation y(t, t’) of the query body is satisfied by the minimal model of K, 
which is guaranteed to exist by Proposition 16. 


Example 18. Consider the following rules: 
ri :path(x,y) +  edge(x,y) (5.7) 
ra :path(x,y) << path(z, z), edge(z, y) (5.8) 


The knowledge base K contains the rules above and the extensional database 
of edges as shown in the relational table Teage below: 


It can be verified that the set of answers of predicate path is given by: 


ans(K, path) 


ajaja 
alata 


b 
b 
b 


C 
Cc 
C 


p i a 


For instance, informally (a, a) € ans(K, path) as there is an edge from a to b 
and, thus, there is a path from a to b. Therefore, as there is and edge from b 
to a as well, there is a path froma toa. 

In order to verify the whole, below we report the computation of the mini- 
mal model Mx according to Proposition 16. We report only the restriction of 
T; to the predicate path, denoted as T;(path) = {t | path(t) € Z;}.3 


3The analogous value of Z;(edge) remains constantly equal to the tuples in F. 
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To(path) = Oo 

Ti(path) = F 

T-(path) = Z\(path) U {(c,a), (b,c), (b,b), (a,a)} 
T3(path) = Ta(path) U {(c,c)} 

Talpath) = TZ3(path) . 


So, the least fixed point, i.e., minimal model is attained after four Tp itera- 
tions. 


5.5 Reasoning 


The major reasoning task is to compute the answer set ans(K, q) of a query 
q, given a KB K. 
A simple query answering procedure to determine ans(K, q) is: 


1. Convert K into K*; 

2. Compute the minimal model Mx of K*; 

3. Store the minimal model Mx of K* in a database; 
4. Translate the query q into an SQL statement; 

5. Execute the SQL query over the relational database. 


A bottleneck of this approach is that Mx may be huge, in the worst case 
exponential in the size of K (note that if k = |Hx| and p is an n-ary predi- 
cate then there are n* ground instances of p). To overcome this problem, the 
LP community has developed numerous alternative approaches [2, 263, 449], 
notably SLD-Resolution [147, 263] and memoing techniques (also called 
tabling/tabulation/magic sets) —see, e.g., [95, 461]. 

We present them here as they are two major representatives, and which 
broadly belongs to the family of so-called query driven answering procedures. 


5.5.1 SLD-Resolution Driven Query Answering 


SLD-Resolution (Selective Linear Definite clause resolution) is the basic 
inference rule used in logic programming (see, e.g., [263]). It is a refinement of 
resolution (see e.g., [67, 93, 158, 167, 378]), which is both sound and refutation 
complete for Horn clauses and was introduced first in [147]. 

The basic principle underlying the method is as follows. We first illustrate 
the propositional case and the FOL case. 

The propositional SLD inference rule is as follows: assume we have propo- 
sitional rules 
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From A<A],...,C,..., Az 

and B<Bi,...,Bm 

and B=C 

infer AS Aj,...,By,...,Bm,..., Ak 


(5.9) 


where the inferred rule is obtained from A + Aj,...,C,..., Az by replacing 
atom C with the atoms B,,...,B,,. The propositional atom B is called the 
selected atom. 

In the FOL case the atoms B and C, which may contain variables, need 
not be equal. However, it is required that B and C have a most general unifier 
(mgu) (see,e.g., [263]), that is a substitution @ of the variables in B and C 
such that the application of the substitution to B and C make the two atoms 
equal. 

Specifically, a substitution @ is a finite set of the form 


0 = {a1/ti,.-.,Ln/tn} 5 


where each x; is variable, each t; is either a variable or constant distinct from 
zi, and the variables 2,,...,t, are distinct. Each element 2;/t; is called a 
binding for xi. 0 is called ground if all t; are constants. 6 is called a variable 
substitution iff all t; are variables. 

Given an atom A and a substitution 9 = {x1/ti,...,¢n/tn}, with Ad 
we denote the atom obtained from A by replacing simultaneously all vari- 
ables x; with t;. Given two substitutions 0 = (11/t1,...,tn/tnj and o = 
[y1/51,...,Ym/Smj, then the composition 00 of 0 and o is the substitution 
obtained from the set 


(21/t10, pax ,Zn/tn0, Y1/51, ae „Ym /Sm y 


by deleting the bindings z;/t;0 for which x; = tio and deleting any binding 
y;/8; for which yj € {1,...,2n}. 

The substitution e given by the empty set is called identity substitution. 

Two atoms A and B are variants iff there is a substitution 0 and o such 
that A = BO and B = Ao. 

A variable substitution 0 = [11/y1,...,tn/Yn) is called a renaming of A 
if given that 


e V is the set of variables occurring in A; 
e all x; are in V; 
e all y; are distinct, 


then 
(V \ {a1,---,2n}) NA {y1,---,ynp =O. 


Let S = {Aj,...,An} be a set of atoms A;, we say that a substitution 0 is 
an unifier for S iff S0 = {A10,...,AnO} is a singleton set. An unifier of S is 
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called most general unifier (mgu) for S if, for each unifier o of S there exists 
a non-empty substitution y such that o = 0y. We say that two atoms A and 
B unify iff there is an mgu for (A, B}. We refer the reader to, e.g., [263, 308], 
for unification algorithms. 

We are now ready to illustrate the SLD-resolution rule for Datalog rules: 


From A+ A4,...,0,..., Ax 

and Be Bi,..., Bm 

and 6 asa mgu of {B,C} 

infer A0 + A10,...,B10,...,Bm0..., AkO 


(5.10) 


in which we assume that all atoms in the second rule have been renamed in 
order not to share any variable with the first one. 
We have also a specialized case that involves a fact 


From At Aj,...,C,...,Ap 


a dé A a mgu of {B,C} (5.11) 
infer A0 + Aj0,...,1,..., AkO 
Now, consider a KB K and a query rule 
q(x) — @. 
An SLD-derivation of q w.r.t. K consists of a finite sequence of rules r1,..., Tn, 


each of which has q as head, rı is the query rule, each rule r;,1 is inferred 
from r; via SLD-resolution, and rp is the rule 


a(x)0 = 1 


telling us that indeed q(x)0 is always true. In that case, we say that the 
restriction of the substitution 0 to the variables in x is a computed answer for 
q. With jx we denote the vector (x10,...,tn0). 

It can be shown that 


Proposition 17 (Soundness & Completeness [263]). Given a KB K and a 
query q of the form q(x) + ¢, then 


1. every computed answer is an answer, i.e., if 0 is a computed answer of 
q w.r.t. K, then 0), € ans(K, q); 


2. for every answer c € ans(K,q), there is a computed answer 6 of q 
w.r.t. K such x = c. 


Hence in order to compute the answer set ans(K, q) via SLD-derivation, it suf- 
fices to determine (carefully, to avoid non-termination) all computed answers 
of q w.r.t. K. 
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Example 19. Consider Example 18 and the query 
q(x, y) = path(x,y) . 


The following is an SLD-derivation of q w.r.t. K: 


q(x, y) — path(x, y) 
q(x, y) — path(z,2),edge(z,y) (x,y) = () 
q(x, y) + path(z, 21), edge(21,2),edge(2,y) (x,y) = (—,-) 
q(x, y) + edge(z, 21), edge(21, z), edge(z,y) Bit) = (=>) 
q(c, y) — 1, edge(b, z), edge(z, y) Oy) = (c, —) 
alc,y) — 1, 1, edge(a, y) izy = (6, —) 
q(c,c) E 1,1,1 Gita = (0, Cc) - 


Note that indeed Oy = (c,c) € ans(K, q) is an answer computed by this 
SLD-derivation. 


Note that Example 19 exhibits also a potential non-terminating issue, if we 
apply the SLD-resolution always to the first atom of the query rule body. To 
this end appropriate atom selection strategies are required to avoid such a 
problem. We will not address them here. We instead present an alternative 
method that does not exhibit this issue, as illustrated next. 


5.5.2 Tabling like Query Driven Query Answering 


The algorithm we present here is an improved query driven query evalu- 
ation algorithm based on the Semi Naive Evaluation Algorithm for Datalog 
(see [33, 437, 449]) and may fall under the label of memoing technique (also 
called tabling/tabulation) —see, e.g., [95, 461]. 

To start with, we recall that the Semi-Naive Evaluation Algorithm for Dat- 
alog?* is a well-known query answering algorithm, whose basic idea is quite sim- 
ple and has the advantage to be implemented on top of a relational database 
system to which to submit SQL queries. 

Basically, as we do not want to rely on grounding K, we collect the answers 
to a query incrementally together: roughly the method is as follows: 


1. start by assuming all IDB (intensional database) relations empty; 


2. repeatedly evaluate the rules using the EDB (extensional database) and 
the previous IDB, to get a new IDB; 


3. stop when there is no change to IDB. 


4nttp://infolab.stanford.edu/~ullman/fcdb/aut07/slides/dlog. ppt 
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Of course, since the EDB never changes, on each round we only get new IDB 
tuples if we use at least one IDB tuple that was obtained on the previous 
round. This allows us to save work by avoiding rediscovering most known 
facts. 

To start with, we use the usual relation “directly depends on” among 
relation symbols, i.e., given K, we say that relation symbol p directly depends 
on relation symbol q if there is a rule in K such that p occurs in the head of 
it and q occurs in the body of it. The relation depends on is the transitive 
closure of “directly depends on”. The dependency graph of K is a directed 
graph where nodes are relation symbols and the set of edges is the “directly 
depends on” relation. The KB is recursive if there is a cycle in the dependency 
graph (î.e., there is p depending on p). 

The query answering method we present is called query driven as we re- 
strict the computation to predicates that depend on a query only. 

At first, consider a general rule of the form p(x) + y(x, y). Assume that 
(x,y) depends on the predicates p1,...,Px, which occur in the rule body 
p(x, y). Assume that A,,,..., Ap, are the answers collected so far for the 
predicates p1,..., px. Let us consider a procedure eval(p, Ap,,..., Ap, ), which 
computes the set of answers c of p, by evaluating the body y(x, y) over the 
data provided by A,,,...,Ap,. Of course, if the predicate p is in the head of 
more than one rule, then eval(p, Ap,,..., Ap, ) is the union of all the evalua- 
tions over the rules having p as head. Note also that eval(p, Ap,,..., Ap, ) is 
monotone in its arguments. 

For instance, consider Example 18. Assume that both Agage and Apatn 
are given by Teage. Then eval(path, Acage, Apatn) returns the set of answers 
Apa = Zo (path). Note that eval(path, Acage, Apatn) can be obtained using 
relational algebra as: 


taba,,,. U ™1,4(taba,,,. 2=3 taba (5.12) 


edge edge path ) $ 


In substance eval revises the set of answers for path. Note that with re- 
spect to Example 18, taba’. does not have the record (c,c). However, we 
obtain the answers after reiterating the evaluation step once more. That is, 
eval(path, Acage; Aath) returns all answers of path. Note also that the union 
in Eq. (5.12) is due to the fact that path is in the head of two rules. 

We are not going to further investigate the implementation details of the 
eval(p, Ap, , ..., Ap, ) procedure, though it has to be carefully written to min- 
imize the number of table look-ups and relational algebraic operations such as 
joins. It can be obtained by means of a combination of SQL statements over 
the tables and the application of the truth combination functions occurring 
in the rule body of p. 

We point out that eval(p, Ap,,..., Ap,) can also be seen as a query to a 
database made out by the relations taba,,,...,taba,, and that any succes- 
sive evaluation step corresponds to the execution of the same query over an 
updated database. We refer the reader to, e.g., [128, 129, 130, 259] concerning 
the problem of repeatedly evaluating the same query to a database that is 
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being updated between successive query requests. In this situation, it may 
be possible to use the difference between successive database states and the 
answer to the query in one state to reduce the cost of evaluating the query in 
the next state. 

We describe now our query answering procedure (see Algorithm 2). Assume 
we are interested in determining all answers of g(x). We start with putting the 
predicate symbol q in the active set of predicate symbols A. At each iteration 
step we select a new predicate p from A and evaluate it using the eval function 
with respect to the answers gathered so far. If the evaluation leads to a better 
answer set for p (A, > v(p)), we update the current answer set v(p) and add 
all predicates p”, whose rule body contains p (the parents of p), to A, i.e., all 
predicate symbols that might depend on p are put in the active set to be 
examined. At some point (even if cyclic definitions are present) A will become 
empty, as the truth space is finite, and we have actually found all answers of 
q(x). 

Algorithm 2 uses some auxiliary functions, data structures, and order def- 
inition: 


e for two sets of tuples A, and Ag, we define 


A> ^ iff A» CA ; (5.13) 


As eval(p;, v(pi,), ..., v(Pi, )) is monotone, an optimized version of 
eval(p;, v(Pi,), ..., V(Pi,, )) may return only the newly retrieved tu- 
ples w.r.t. v(p;). In that case, steps 10 and 11 have to be replaced with 


if Ap, # Í then v(p;) := v(p;) U A»,, A := AU (p(p;) N dg) 


e for predicate symbol p;, s(p;) is the set of predicate symbols occurring 
in the rule body of a rule having p; as head, i.e., the sons of pi; 


e for predicate symbol p;, p(p,) = {p; : pi € s(p;)), i.e., the parents of p;; 


e in step 9, pi,,---,Pi,, are all predicate symbols occurring in the rule 
bodies having p; in its head, i.e., the sons s(p;) = [p;,,...,Di, } Of Pi; 


e the variable dg collects the predicate symbols that may influence the 
result of the query predicates; 


e the array variable exp traces the rule bodies that have been “expanded” 
(the predicate symbols occurring in the rule bodies are put into the 
active list); 


e the variable in keeps track of the predicate symbols that have been put 
into the active list so far due to an expansion (to avoid, to put the same 
predicate symbol multiple times in the active list due to rule bodies 
expansion). 
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Algorithm 2 Answer(K, q) 


Input: Knowledge base K, query q. 
Output: A mapping v such that it contains all answers of q. 
1: A:= {q}, dg := {q}, in := 90. 

2: for all predicate symbols p in P do 

3:  v(p)=0, exp(p) = false 

4: while A Z () do 

5: select p; € A, A := A {pi}, de := dg U s(p;) 

6: if (p; extensional predicate) A (v(p;) = 0) then 

7: v(p;) := tabp; 

8 


if p; intensional predicate then 


9: Ap, := eval (pi, v(pi ), = V(Pix, )) 

10: if Ap, > v(p;) then 

11: v(pi) := Ap, , A:= AU (p(p;) N dg) 

12: if not exp(p;) then 

13: exp(p;) = true, A:=AU (s(p;) \ in), in := inUs(p;) 


14: return v 


From a computational complexity point of view, we recall that there are two 
main kinds of complexity connected to plain Datalog [452]: 


e the data complexity is the complexity of checking whether (F,P) H A 
when the set of rules P is fixed, whereas input databases F and ground 
atom A are an input; 


e the combined complexity is the complexity of checking whether (F, P} E 
A when input database F, the set of rules P, and ground atoms A are 
an input. 


Grounding P on an input database F yields polynomially many clauses in the 
size of F; hence, the complexity of propositional logic programming, which is 
linear (see, e.g., [113, 132, 452]), is an upper bound for the data complexity. 
We thus get 


Proposition 18 ([113, 453]). Deciding whether t € ans(K, q) is P-complete 
in data complexity. The program complezity is exponentially higher: deciding 
whether t € ans(K, q) is Exp Time-complete in combined complezity. 


Example 20. Consider Example 18. Let us compute all answers of predicate 
path. So, let q = path. The execution of Answer(K, q) is shown in Table 5.1, 
which also reports Ap, and v(p;) at each iteration i. 
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TABLE 5.1: Query driven computation related to Example 20. 

1. A:= {path}, p; := path, A := Í, dg := (path, edge}, Apatn := 0 
exp(path) := true, A := (path, edge), in := (path, edge} 

2. py := path, A:= = Jodee}, Apatn := 0 

3. pi := edge, A := (), v(edge) = 0, v(edge) := tabeage, A := {path}, exp(edge) := true 

4. py := path, A := Ú, Apatn > v(path), v(path) := Apatn, A := {path} 

5. py := path, A := 0, Apatn > v(path), v(path) := Apatn, A := {path} 

6. py = path, A := Ú, Apatn > v(path), v(path) := Apatn, A := {path} 

7. pi = path, A := 0, Aparn = v(path) 

8. stop. return v(path) 
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Chapter 6 


Query Languages for SWL-based 
Knowledge Bases 


6.1 Introduction 


SPARQL [380, 381] is a query language and a protocol for data that is 
stored natively as RDF or viewed as RDF via middleware. The main mech- 
anism for computing query results in SPARQL is subgraph matching: RDF 
triples in both the queried RDF data and the query pattern are interpreted as 
nodes and edges of directed graphs, and the resulting query graph is matched 
to the data graph using variables as wild cards. Various W3C standards, in- 
cluding RDF and OWL, provide semantic interpretations for RDF graphs that 
allow additional RDF statements to be inferred from explicitly given asser- 
tions. Many applications that rely on these semantics require a query language 
such as SPARQL, but in order to use SPARQL, basic graph pattern match- 
ing has to be defined using semantic entailment relations instead of explicitly 
given graph structures. There are different possible ways of defining a basic 
graph pattern matching extension for an entailment relation. [379] specifies 
one such way for a range of standard Semantic Web entailment relations. Such 
extensions of the SPARQL semantics are called entailment regimes. An en- 
tailment regime defines not only which entailment relation is used, but also 
which queries and graphs are well-formed for the regime, how the entailment 
is used (since there are potentially different meaningful ways to use the same 
entailment relation), or what kinds of errors can arise. 

We are not going into the details of SPARQL specification, as we did not 
go into the details of RDFS, OWL 2, and RIF specification, but rather look 
at it from a logical perspective. 

Specifically, we first introduce the notions of conjunctive / disjunctive 
query for KBs based on RDFS, Description Logics, and Logic Programs. Con- 
junctive queries have already been introduced in the previous chapters, though 
here we give a more involved version, which will be useful later on in this book. 
We then provide the formal definition of SPARQL 1.1. 
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6.2 Conjunctive and Disjunctive Queries 


The RDES case. We assume that a RDFS graph G is ground, that is blank 
nodes have been skolemized, 2.e., replaced with terms in UL. We recap here 
and extend the definitions of Section 3.3. So, an RDFS query is of the rule-like 
form 


q(x) — 3y.p(x, y) 


where we additionally allow built-in triples of the form (s,p,o), where p is 
a built-in predicate taken from a reserved vocabulary and having a fixed in- 
terpretation. We generalize the built-ins to any n-ary predicate p, where p's 
arguments may be pdf variables, values from UL, and p has a fixed interpre- 
tation. We will assume that the evaluation of the predicate can be decided in 
finite time. For convenience, we write “functional predicates”! as assignments 
of the form x:=f(z) and assume that the function f(z) is safe. We also as- 
sume that a non-functional built-in predicate p(z) should be safe as well. For 
instance, 


q(«1, 22) — (x, worksFor, google), (x, hasSalary, s), (s, <, 23000) 


is an RDFS query asking for Google employees earning less than 23000. Here 
< is a built-in predicate. 
The notions of answer and answer set of q w.r.t. G, 1.e., 


ans(G,q) = {t | G EF a(t)} , 


are as from Section 3.3. 

As next, we extend the query language by allowing so-called aggregates to 
occur in a query. Essentially, aggregates may be like the usual SQL aggregate 
functions such as SUM, AVG, MAX, MIN. 

For instance, suppose we are looking for employees that work for some 
company. We would like to know the average salary of their employment. 
Such a query may be expressed as 


q(a,avgS) +=  (x,worksFor, y), (x, hasSalary, s), 
GroupedBy(x), 
avgS :=AVG[s] . 


Essentially, we group by the employee, consider for each employee the 
salaries, and compute the average salary value for each group. That is, if 
g = {(t,t1),..., (t,tn)} is a group of tuples with the same value t for employee 
x, and value t; for s, then the value of avgL for the group g is (X; t;)/n. 


1A predicate p(x, y) is functional if for any t there is unique t' for which p(t, t’) is true. 
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Formally, let @ be an aggregate function with 
Q € (SUM, AVG, MAX, MIN, COUNT} 


then a query with aggregates is of the form 


q(x,a) + Jy.p(x,y), 
GroupedBy(w), (6.1) 
a:=0[f(z)) 


where w are variables in x or y, each variable in x occurs in w and any 
variable in z occurs in y. 

From a semantics point of view, we say that Z is a model of (satisfies) 
q(t, a), denoted T E q(t, a) if and only if 


a=Qla¡,...,ax] where g = {(t,t1),..., (t, te) }, 
is a group of k tuples with identical projection 
on the variables in w, y(t, t,) is true in Z 
and a, = f(t) where t, is the projection of tr 
on the variables z . 


Now, the notion of G |= q(t, a) is as usual, i.e., any model of G is a model of 
q(t, a). 

Eventually, we further allow to order answers according to some ordering 
functions. For instance, assume that additionally would like to order the em- 
ployee according to the average salary of employment. Then such a query will 
be expressed as 


q(z,avgS) +=  (x,worksFor, y), (x, hasSalary, s), 
GroupedBy(x), 
avgS :=AVG[s], 
OrderBy(avgS) . 


Formally, a query with ordering is of the form 


q(x,2) +  3Jy«p(x, y), OrderBy(z) 


or, in case grouping is allowed as well, it is of the form 


q(x,2,0) += Jy.y(x,y), 
GroupedBy(w), 
a:=0[f(z)l, 
OrderBy(2) . 


(6.2) 


From a semantics point of view, the notion of G |= q(t, z, a) is as before, but 
the notion of answer set has to be enforced with the fact that the answers 
are now ordered according to the assignment to the variable z. Of course, we 
require that the set of values over which z ranges can be ordered (like string, 
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integers, reals). Finally, note that the additional of the SQL-like statement 
LIMIT(&) can be added straightforwardly. 

It is obvious that we may devise the same query answering method as seen 
in Section 3.4, by computing the closure, store it into a database, and then 
using SQL queries with the appropriate support of built-in predicates. 


Proposition 19. Given a ground graph G, t is an answer to q if and only if 
there exists an instantiation p(t,t') that is true in the closure of G (i.e., all 
triples in p(t,t') are in cl(G)). The answer set can be computed in LogSpace 
w.r.t. the size of cl(G). 


We conclude by defining a disjunctive query q as usual as a finite set of 
conjunctive queries in which all the rules have the same head. Intuitively, the 
answers to a disjunctive query are the union of the answers of the conjunctive 
queries. That is, for a disjunctive query q = [q1,...,4m), G entails q(t) 
(denoted G = q(t)) iff G H q(t) for some q; € q. 

The notion of answer and answer set of a disjunctive query is a straightfor- 
ward extension of the ones for conjunctive queries. Also, it is straightforward 
to see that the answer set of a disjunctive query can be computed similarly 
as for conjunctive queries. 


The OWL 2 and RIF cases. It is pretty obvious that the notion introduced 
for conjunctive RDFS queries straightforwardly extends to the cases of OWL 
2 and RIF. Essentially in the query body we have to replace triple expressions 
with FOL predicates, which are unary or binary in case of OWL 2 (that is in 
the DLs case), and n-ary in case of RIF (that is in the LPs case). 


6.3 SPARQL 


Our introduced query language so far allows for conjunctive and disjunctive 
queries with aggregates, which may be seen as the logical counterpart of SQL. 
In this section we will present now SPARQL, which has some specific features 
not covered so far. 

We recall that SPARQL [380, 381] is the W3C recommended query lan- 
guage for RDF. 

A SPARQL query is defined by a triple Q = (P, G, V), where P is a graph 
pattern (defined below) and the dataset G is an RDF graph and V is the 
result form. We will restrict ourselves to SELECT queries in this work so it is 
sufficient to consider the result form V as a list of variables. 


Remark 11. Note that, for presentation purposes, we simplify the notion of 
datasets by excluding named graphs and thus GRAPH queries. Our definitions 
can be straightforwardly extended to named graphs and we refer the reader to 
the SPARQL W3C specification [381] for details. 
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We base our semantics of SPARQL on the semantics presented by Pérez 
et al. [348], extending the multiset semantics to lists, which are considered a 
multiset with “default” ordering. 

RDF triples, possibly with variables in subject, predicate, or object posi- 
tions, are called triple patterns. In the basic case, a graph pattern P is a set 
of triple patterns, also called basic graph pattern (BGP). 

Let U, B, L be defined as for RDFS (Chapter 3) and let V denote a set of 
variables, disjoint from UBL. We further denote by var(P) the set of variables 
present in a graph pattern P. 

Given a graph G and a BGP P, a solution [381, Section 12.3.1] 0 for P over 
G is a mapping over a subset V of var(P), i.e., 0 : V — term(G) such that 
G E PO where PO represents the triples obtained by replacing the variables 
in graph pattern P according to 0, and where G + PO means that any triple 
in PO is entailed by G. We call V the domain of 0, denoted by dom(0). For 
convenience, sometimes we will use the notation 6 = (11/t1,...,tn/tnj to 
indicate that 0(x,) = ti, i.e., variable x; is assigned to term t;. 

Two mappings 6; and 62 are considered compatible if for all x € dom(8.)N 
dom(02),01(1) = 02(). We call the evaluation of a BGP P over a graph G, 
denoted [Pl], the set of solutions. 


Remark 12. Note that variables in the domain of 9 play the role of distin- 
guished variables in conjunctive queries and there are no non-distinguished 
variables. 


The notion of solution for BGPs is the same as the notion of answers for 
conjunctive queries: 


Proposition 20. Given a graph G and a BGP P, then the solutions of P are 
the same as the answers of the query q(var(P)) — P (where var(P) is the 
vector of variables in P), i.e., ans(G, q) = [Ple. 


We present the syntax of SPARQL based on [348] and present graph patterns 
similarly. A triple pattern (s,p,o) is a graph pattern where s,o € ULV and 
p € UV? Sets of triple patterns are called Basic Graph Patterns (BGPs). A 
generic graph pattern is defined in a recursive manner: any BGP is a graph 
pattern; if P and P’ are graph patterns, R is a filter expression (see [381]), 
then (P AND P”), (P OPTIONAL P”), (P UNION P”), (P FILTER R) are 
graph patterns. As noted in Remark 11 we do not consider GRAPH patterns. 

Evaluations of more complex patterns including FILTERs, OPTIONAL 
patterns, AND patterns, UNION patterns, etc., are defined by an algebra 
that is built on top of this basic graph pattern matching (see [348, 381]). 

The SPARQL relational algebra is defined as follows: Let O, and Qz be 
sets of mappings: then 


2We do not consider blank nodes in triple patterns since they can be considered as 
variables. 
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Qı Dx OQ» = 10, U 02 | 01 € Qı, 0, € Qo, 0, and 02 compatible) 

O UO» = {0 | 0 E€ Qı or9 € Ma) 

Q,-Q2 = (01 E Qi | for all 92 € Oo, 0, and @ not compatible) 
Qı XI oa = (Q1 X Q2) U (Q1 — 9) . 


Now, let 7 = (s,p,o) be a triple pattern, P, P,, Pa graph patterns and G an 
RDF graph, then the evaluation [348, Definition 2.2] [-]¢ is recursively defined 
as follows: 


= 10| dom(0) = var(P) and G = 70) 


H FILTER Pola = [Pile bX [Pala 

IP, AND Pola = [Pile Dad [Pale 

[P, UNION Pala = [Pile Y [Pole 

[Pi OPTIONAL Pola = [Pile bX [Pala 

[P FILTER Ra = (0€ [Ple | RO is true}. 


Let R be a FILTER? expression, u,v € VU UBL. The valuation of R on a 
substitution 0, written RO, is true if: 


1. R= BOUND(v) with v € dom(0); 

2. R= isBLANK(v) with v € dom(9) and 6(v) € B 

. R=islRi(v) with v € dom(0) and 0(v) € U; 

. R= isLITERAL(v) with v € dom(9) and 8(w) € L; 

u =v) with u,v € dom(0) UUBLA 0(u) = 0(v); 
=R,) with R49 is false; 


II 


RV Re) with R498 is true or R30 is true; 


o a agn p g 


R=( 
R=(>R 
R=( 
R = (Ri A Ro) with R10 is true and R20 is true. 
RO yields an error (denoted e), if: 


1. R= isBLANK(v), R = isIRI(v), or R = isLITERAL(v) and v ¢ dom(0) U 


L; 
2. R= (u = v) with u g dom(9) UT or v ¢ dom(0) UT; 
3. R= (=Rı) and R10 =€; 
4. R= (RV Ro) and (ROZ T and R20 A T) and (R10 = £ or Ro = €); 
5. R= (R1 ^A R2) and R4108 = € or R390 =e. 


3For simplicity, we will omit from the presentation FILTERs such as comparison operators 
(‘<’, >>), data type conversion and string functions and refer the reader to [381, 
Section 11.3] for details. 


Query Languages for SWL-based Knowledge Bases 93 


Otherwise RO is false. 

In order to make the presented semantics compliant with the SPARQL 
specification [381], we need to introduce an extension to consider unsafe 
FILTERs (also presented in [10]). 

So, let P,, P> be graph patterns R a FILTER expression. A mapping @ is 
in [P, OPTIONAL (P, FILTER R)] pg if and only if: 


1. 0 = 01 U b2, s.t. 01 € [Pi], 02 € [Pala are compatible and RO is true, 
or 


2. 0€ [Pila and vo € [P2]a, 6 and 62 are not compatible, or 


3. 0 € [Pila and vâ € [Pala s.t. 6 and 62 are compatible, and R63 is false 
for 03 =0U 02. 


We next extend our definitions to include variable assignments, aggregates, 
and solution modifiers. These are extensions similar to the ones presented re- 
lated to assignments and aggregations for conjunctive queries (see Section 6.2). 

So, let P be a graph pattern and G a graph, the evaluation of an ASSIGN 
statement is defined as: 


[P ASSIGN f(z) AS za = {0 | 61 € [Pla, 9 = 01 [2/£(01(2)))) 


where 
_ f§ OU {z/t} if z ¢ dom(0) 
0[2/t] = { (0 \ {z/t'}) U{z/t} otherwise . 


Essentially, we assign to the variable z the value f(@1(z)), which is the evalu- 
ation of the function f(z) with respect to a substitution 01 € [P]c. 

For instance, using a built-in function we can retrieve for each employee 
the length of employment for any company: 


SELECT ?x ?y ?z WHERE { 
(?x worksFor ?y), (?x workingPeriod 71) 
ASSIGN length(?1) AS ?z 

F 


Here, the length built-in predicate returns, given a temporal expression, 
e.g., encoded in some way, the overall total length of the intervals. 


Remark 13. Note that this definition is more general than “SELECT expr AS 
?var” project expressions in current SPARQL 1.1 [380] due to not requiring 
that the assigned variable be unbound. 


We introduce the ORDERBY clause where the evaluation of a [P ORDERBY ?z]a 
statement is defined as 


[P ORDERBY ?zla = [Plot 
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where 17, means that the ordering of the solutions — for any 9 € [Pla — ac- 
cording to the values of 0(?x). The ordering follows the rules in [381, Section 
9.1]. Likewise, the SQL-like statement LIMIT(k) can be added straightfor- 
wardly. Of course, similarly to ordering in the conjunctive query answering 
setting, we require that the set of values over which x ranges can be ordered 
and some linearization method for possible posets may be applied if necessary, 
such as [248]. 

We can further extend the evaluation of SPARQL queries with aggregate 
functions 

Q e (SUM, AVG, MAX, MIN, COUNT) 


as follows: the evaluation of a GROUPBY statement is defined as:* 
[P GROUPBY(w) Qf(z) AS ala = {6 | 01 in [Ple, 
0 = A; | wlai/@; fi (A:(Zi))] }oistinct 


where the variables a; ¢ var(P), z; € var(P) and none of the GROUPBY vari- 
ables w are included in the aggregation function variables z;. Here, we denote 
by |w the restriction of variables in 8 to variables in w. Using this notation, we 
can also straightforwardly introduce projection, i.e., sub-SELECTs as an alge- 
braic operator in the language covering another new feature of SPARQL 1.1: 
namely 


[SELECT V (Pia = (0101 in [Ple,0=011w) . 


Remark 14. Please note that the aggregator functions have a domain of 
definition and thus can only be applied to values of their respective domain. For 
example, SUM and AVG can only be used on numeric values, while MAX, MIN 
are applicable to any total order. Resolution of type mismatches for aggregates 
is being defined in SPARQL 1.1 [380]. The COUNT aggregator can be used for 
any finite set of values. 


Remark 15. Please note that, unlike the current SPARQL 1.1 syntax, assign- 
ment, solution modifiers (ORDER BY, LIMIT), and aggregation are stand- 
alone operators in our language and do not need to be tied to a sub-SELECT 
but can occur nested within any pattern. This may be viewed as syntactic sugar 
allowing for more concise writing than the current SPARQL 1.1 [380] draft. 


Example 21. Suppose we want to know, for each employee, the average salary 
of their employments with different employers. Then such a query will be ex- 
pressed as: 


SELECT ?x ?avgS WHERE 4 
(?x worksFor ?y) (?x hasSalary ?s) 
GROUPBY (?x) 
AVG(?s) AS ?avgS 


“In the expression, @f(z) AS a is a concise representation of n aggregations of the form 
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Essentially, we group by the employee, consider for each employee its 
salaries and compute the average value for each group. That is, if g = 
[(t,t1),...,(t,tn)) is a group of tuples with the same value t for employee 
x, and value t; for s, then the value of avgS for the group g is O ,t;)/n. 


Proposition 21. Assuming the built-in predicates are computable in finite 
time, the answer set of any SPARQL is finite and can also be computed in 
finite time. 


This proposition can be demonstrated by induction over all the constructs we 
allow in SPARQL. 
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Chapter 7 


Introduction 


There has been a long-lasting misunderstanding in the literature of artifi- 
cial intelligence and uncertainty modelling, regarding the role of probabil- 
ity /possibility theory and vague/fuzzy theory. A clarifying paper is [143]. We 
will recall here salient notes, which may clarify the role of these theories for 
the inexpert reader and will move on to incorporate fuzziness within SWLs in 
the following chapters. 

A standard example that points out the difference between degrees of 
uncertainty and degrees of truth is that of a bottle [143]. In terms of bi- 
nary truth values, a bottle is viewed as full or empty. But if one accounts 
for the quantity of liquid in the bottle, one may, e.g., say that the bottle is 
“almost-full.” Under this way of speaking, “almost full” becomes a fuzzy pred- 
icate [472] and the degree of truth of “the bottle is almost full” reflects the 
amount of liquid in the bottle. The situation is quite different when express- 
ing our ignorance about whether the bottle is either full or not full (we know 
that only one of the two situations is the true one). Saying that the proba- 
bility that the bottle is full is 0.8 does not mean that the bottle is almost 
full. 

We recall that under uncertainty theory fall all those approaches in which 
statements rather than being either true or false, are true or false to some 
probability or possibility (for example, “it will rain tomorrow”). That is, a 
statement is true or false in any world, but we are “uncertain” about which 
world to consider as the right one, and thus we speak about, e.g., a probability 
distribution or a possibility distribution over the worlds. For example, we 
cannot exactly establish whether it will rain tomorrow or not, due to our 
incomplete knowledge about our world, but we can estimate to which degree 
this is probable, possible, or necessary. 

As for the main differences between probability and possibility theory, the 
probability of an event is the sum of the probabilities of all worlds that satisfy 
this event, whereas the possibility of an event is the maximum of the possibili- 
ties of all worlds that satisfy the event. Intuitively, the probability of an event 
aggregates the probabilities of all worlds that satisfy this event, whereas the 
possibility of an event is simply the possibility of the “most optimistic” world 
that satisfies the event. Hence, although both probability and possibility the- 
ory allow for quantifying degrees of uncertainty, they are conceptually quite 
different from each other. That is, probability and possibility theory represent 
different facets of uncertainty. 
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On the other hand, under vagueness/fuzziness theory fall all those ap- 
proaches in which statements (for example, “the tomato is ripe”) are true to 
some degree, which is taken from a truth space. That is, an interpretation 
maps a statement to a truth degree, since we are unable to establish whether 
a statement is completely true or false due to the involvement of vague con- 
cepts, such as “ripe,” which only have an imprecise definition. For example, 
we cannot exactly say whether a tomato is ripe or not, but rather can only 
say that the tomato is ripe to some degree. Usually, such statements involve 
so-called vague/fuzzy predicates [472]. 

Note that all vague/fuzzy statements are truth-functional, that is, the de- 
gree of truth of every statement can be calculated from the degrees of truth of 
its constituents, while uncertain statements cannot be a function of the uncer- 
tainties of their constituents [142]. More concretely, in probability theory, only 
the negation is truth-functional (see Equation. (E.1)), while in possibility the- 
ory, only the disjunction resp. conjunction is truth-functional in possibilities 
resp. necessities of events (see Equation (E.6)). 

Furthermore, fuzzy logics are based on truly many-valued logical operators, 
while uncertainty logics are defined on top of standard binary logical operators. 

We refer the interested reader to Appendix E for a formalization of a simple 
propositional probabilistic or possibilistic logic, which may help the reader to 
verify the differences to fuzziness. 


Chapter 8 


Fuzzy Sets and Mathematical Fuzzy 
Logic Basics 


8.1 Fuzzy Sets Basics 


The aim of this section is to introduce the basic concepts of fuzzy set 
theory. To distinguish between fuzzy sets and classical (nonfuzzy) sets, we 
refer to the latter as crisp sets. For an in-depth treatment we refer the reader 
to, e.g., [139, 242]. 


8.1.1 From Crisp Sets to Fuzzy Sets 


To better highlight the conceptual shift from classical sets to fuzzy sets, we 
start with some basic definitions and well-known properties of classical sets. 
In the following, let us denote with X the universal set containing all possible 
elements of concern in each particular context. The power set, denoted 24, of 
a set A C X, is the set of subsets of A, i.e., 


24 =(B|BCA). 


Often sets are defined by specifying a property satisfied by its members, in 
the form 

A= {z | P(a)}, 
where P(x) is a statement of the form “x has property P” that is either true 
or false for any x € X. 


Example 22. Examples of universe X and subsets A,B € 2% may be 


X = {«| «x is a person} 
A = {a| «x is an employee} 
B = {a| «a is an employee and has salary equal or less than 23000} . 


In the above case we have BC AC X. 


The characteristic function, denoted xa, of a set A C X is a function mapping 
elements of X into (0,1), ie., 


XA: X — {0,1} 
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and is defined as 
1 frreA 


vata) = { 0 forrgA. 


Note that it is easily verified that for any sets A,B € 2*, we have that (the 
reader may verify it also in the case of Example 22) 


AC Biff Vx € X. xalx) < xplz). (8.1) 


Note that (C,2*) is a Boolean algebra, or Boolean lattice, i.e., a complemented 
distributive lattice. 

When clear from context, we denoted with |A| the cardinality of a denu- 
merable set A C X. The complement of a set A is denoted A, i.e., 


A=X\A. 
Of course, | = X and X = () and observe that 
Va € X. xal) =1-xalx). (8.2) 


In a similar way, we may express set operations of intersection and union via 
the characteristic function as follows: 


Va € X. xanglx) = min(xA(2),xB()) (8.3) 
Vz € X. xauéla) = max(xa(z),xsB(x)). (8.4) 


The Cartesian product, A x B, of two sets A, B € 2% is defined as 
Ax B= {(a,b)|a € A,bE B}. 


The most fundamental properties of set operations are illustrated in Table 8.1 
(where A,B,C C X). 

As defined so far, the characteristic function of a crisp set A assigns a value 
of either 1 or 0 to each individual of the universe set and, thus, discriminating 
between being a member or not being a member of A. 

However, in many cases this way to define sets is unsatisfactory, as illus- 
trated below. 


Example 23. For instance, by referring to Example 22, let us try to define 
the set C of employees with a low salary, i.e., 


C = {x € A |x has low salary) . 


The problem relies on the fact that, in order to define such a set precisely, 
we need to define when a salary is low and when it is not, i.e., we need a 
definition making the statement P(x) “x has low salary” either true or false 
for any x. For instance, we may decide that P(x) is true iff the salary is equal 
or less than € 2000, or equivalently, using the characteristic function, define 


G= 1 if salary of x is equal or less than € 2000 
XC) 0 otherwise . 
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TABLE 8.1: Fundamental properties of crisp set operations. 


Involution A = A 
Idempotence AUA = A 
ANA = A 
Commutativity AUB = BUA 
ANB = BNA 
Associativity (AUB)UC = AU(BUC) 
(ANB)NC = AN(BNC) 
Distributivity AN(BUC) = (ANB)U(ANC) 
AU(BNC) = (AUBIN(AUC) 
Absorption AU(ANB) = A 
AN(AUB) = A 
Identity AUX = X 
And = 0 
AUD = A 
ANX = A 
Law of contradiction ANA = 0 
Law of excluded middle AUĀ = X 
De Morgan’s rule AUB = A 
A 


C 
Si îi 


It becomes evident then that the selection of such a threshold may be rather 
subjective and context dependent. It also becomes difficult to conceive that 
an employee having a salary of € 2001 does not have a low salary. 


The main point is that statements involving concepts, such as 
low, medium, high (8.5) 


may not be a matter of true or false, but rather are graded, where the grade 
may be taken from a specified range and indicates the membership degree to 
which elements of the universe belong to the set in question. Larger values 
denote higher degrees of set membership. 

This idea can be formalized [472] by generalizing characteristic functions in 
such a way that the values assigned to the elements of the universe fall within 
a specified range, which is not necessarily {0,1}. Such a function is called 
membership function, the set so defined is called fuzzy set, concepts such as 
those in Equation (8.5) are called fuzzy concepts, and statements involving 
fuzzy concepts are called fuzzy statements. 

The most commonly used range of membership function is [0, 1], though 
other mathematical structures are used as well (see e.g., [186, 241, 242]). 
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To what concerns us here, we will either use [0, 1] or, another typical setting 
based on the finite-valued set (n > 3). 


n—2 


=I}. (8.6) 


1 
Ly = Da eee EI 
w n—1 n= 


For instance, for n = 5, 
Ls = {0,0.25, 0.5, 0.75, 1} . 


There are typically two distinct notations employed in the literature to denote 
the membership function: in one of them the membership function of A is 
denoted u4 and is a function 


pa: X > (0,1), 
while in the other one this function is denoted A and, thus, is a function 
A: X + [0,1]. 


Note that in the second case, the symbol A may have two roles, one to indicate 
the fuzzy set and another one to indicate the fuzzy membership function. We 
will use both notations and the second one only if no ambiguity about the 
role of the symbol A arises. 

With 2* we denote the fuzzy power set over X, i.e., the set of all fuzzy 
sets over X. 


Example 24. By referring to Example 23, we may define the membership 
function uc as, e.g., 


1 if salary of x is equal or less than € 2000 
mola) = 4 (2500 — x)/500 if salary of x is in between € 2000 and € 2500 
0 otherwise , 


which is illustrated below. 


Hc(x) 


0 2000 2500 salary 


So, for instance, an employee having a salary of € 1500 definitely belongs to 
C (the degree is 1), one having a salary of € 2100 belongs to C to degree 0.8, 
while another one having a salary of € 2700 does not belong to C (the degree 
is 0). 
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As pointed out previously, the definition of the membership function depends 
on the context, e.g., the definition of low salary may vary from country to 
country. Moreover, also the shape of such functions may quite be different. 
However, the trapezoidal (Fig. 8.1 (a)), the triangular (Figure 8.1 (b)), the 
L-function (left-shoulder function, Figure 8.1 (c)), and the R-function (right- 
shoulder function, Figure 8.1 (d)) are simple, but most frequently used to 
specify membership degrees. 

The trapezoidal function, trz(a,b, c,d), is defined as follows: let a < b < 
c < d be rational numbers then 


0 if a <a 
(x—a)/(b— a) if x € (a,b] 

trz(a,b,c, da) = 1 if x € (b, c] 
(d—zx)/(d— c) if a € (c,d 
0 if a > d 


0 fx<a 
i (x—a)/(b=a) if x € (a,0] 
rs E (c—-x)/(c—-b) ifxe (b,c 
0 ife@>c. 


Note that tri(a, b,c) = trz(a, b, b,c). The L-function is defined as 


1 ifz <a 
ls(a,b)\(x)= < (b—-x)/(b—a) ifae (a,b) 
0 ifz > b. 
Finally, the R-function is defined as 
0 ifa<a 
rs(a,b)(z) = 4 (x—a)/(b—a) if a € (a, b] 
1 ifz >b. 


So, for instance, in Example 24, we defined the membership function of the 
employees with low salary as 


uc = 1s(2000, 2500) . 


Although fuzzy sets have a far greater expressive power than classical crisp 
sets, its usefulness depends critically on our capability to construct appropriate 
membership functions for various given concepts in different contexts. The 
problem of constructing meaningful membership functions is a difficult one 
and we refer the interested reader to, e.g., [242, Chapter 10]. However, one 
easy and typically satisfactory method to define the membership functions is 
to uniformly partition the range of, e.g., salary values (bounded by a minimum 
and maximum value), into 5 or 7 fuzzy sets using either trapezoidal functions 
(e.g., as illustrated in Figure 8.2), or using triangular functions (as illustrated 
in Figure 8.3). The latter one is the more used one, as it has less parameters. 


106 Foundations of Fuzzy Logic and Semantic Web Languages 


FIGURE 8.1: (a) Trapezoidal function trz(a, b, c, d); (b) Triangular function 
tri(a,b,c); (c) L-function ls(a, b); and (d) R-function rs(a, b). 


Very Low Low Medium High Very High 


max 


salary 


FIGURE 8.2: Fuzzy sets over salaries using trapezoidal functions. 
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Low Medium High High 


0 max 


salary 


FIGURE 8.3: Fuzzy sets over salaries using triangular functions. 


8.1.2 Standard Fuzzy Set Operations 


We have seen that there are three basic operations on crisp sets, namely the 
complement, the intersection, and the union. There are many ways to general- 
ize these operations to fuzzy sets, though, one particular generalization, called 
a standard fuzzy set operation, has been introduced originally in [472] and fol- 
low immediately as a generalization of the characteristic functions of crisp sets 
defined in the previous Section 8.1. In fact, inspired by Equations (8.2)—(8.4), 
for fuzzy sets A, B € 2%, the standard fuzzy set operations are defined for any 
TEX as 


A(z) = 1-A(z) (8.7) 
(AN B)(x) = min(A(x), B(z)) (8.8) 
(AU B)(x) = max(A(zx), B(x)). (8.9) 


In Figures 8.4 — 8.6 we show the graphical interpretation of standard fuzzy 
set operations for fuzzy sets with triangular membership function, where the 
grey part is the result of the operation. 


The standard inclusion among fuzzy sets A,B € 2* is defined directly from 
Equation (8.1) as 
AC Biff Vx € X. A(x) < Bla). (8.10) 


It is interesting to observe that under definition (8.10) of inclusion, in fact the 
inclusion relation is crisp and not fuzzy. We will see later on how one may 
generalise this. Furthermore, note that under standard fuzzy set operations, 
(C, 2*) is a complemented distributive lattice (a Boolean algebra/lattice). It 
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FIGURE 8.4: Graphical view of fuzzy set complement operation. 


FIGURE 8.5: Graphical view of fuzzy set intersection operation. 


FIGURE 8.6: Graphical view of fuzzy set union operation. 


Fuzzy Sets and Mathematical Fuzzy Logic Basics 109 


satisfies all the properties listed in Table 8.1, except the law of contradiction 
and the law of excluded middle. In fact it is easily verified that, e.g., 


min(A(x),1— A(x)) =0 


is violated for at least one x € X. This is easy as it is violated for any 
A(x) € (0,1), while it holds for A(x) € {0,1}. The argument showing that 


max(A(x), 1 — A(x%)) =1 


is violated is the same. 


8.1.3 Norm-Based Fuzzy Set Operations 


In the previous section we introduced the standard fuzzy set operations, 
derived from their crisp analogue, and have seen that they behave exactly as 
the corresponding crisp operations if the range in {0,1} in place of [0, 1]. 


8.1.3.1 T-Norms 


Of course, standard fuzzy set operations are not the only ones that can 
be conceived to be suitable to generalize the classical Boolean operations. 
For each of the three types of operations there is a wide class of plausible 
fuzzy version. The most notable ones are characterized by the so-called class 
of t-norms (called triangular norms), t-conorms (also called s-norm), and 
negation (see, e.g., [241]). 

T-norms are used to define fuzzy set conjunction, t-conorms are used to 
define fuzzy set disjunction, while the negation operator is used to define 
the fuzzy set complement operation. Several t-norms, s-norms, and negation 
functions have been given in the literature. We will denote a t-norm with $, 
an s-norm with O, while will denote a negation with ©. 

An important aspect of such functions is that they satisfy some properties 
that one expects to hold for the connectives as illustrated in Tables 8.2 and 8.3 
(a,b € [0, 1]). 


TABLE 8.2: Axioms for t-norms and s-norms. 


Axiom Name T-norm S-norm 

Boundary condition aQl=a aĝ0 =a 
Commutativity a®b=b@a ab=bga 
Associativity (a@b) @c=a® (b@c) (a@b)@c=a6 (bGc) 


Monotonicity ifb<cthena®b<a®c ifb<cthena®Sb<aGc 
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TABLE 8.3: Axioms for negation functions. 


Axiom Name Negation Function 
Boundary condition 60=1 

e1=0 
Antitonicity if a < b then Oa > ©b 


Note that from the axioms we get also that 


a®0 0 
al = 1 
agb < a 
ab > a. 


Of course, due to commutativity, 8 and $ are monotone also in the first 
argument. 

For any a € [0,1], we say that a negation function © is involutive iff 
©©a =a for any a € [0,1]. Salient negation functions are: 


Standard or Lukasiewicz negation: Oa = 1 — a; 
Gödel negation: Oya is 1 if a = 0, else is 0. 


Of course, Łukasiewicz negation is involutive, while Gödel negation is not. 
Salient t-norm functions are: 


Gödel t-norm: a 8y b = min(a, b); 
Bounded difference or Lukasiewicz t-norm: a &; b = max(0,a +b — 1); 
Algebraic product or product t-norm: a®, b= a: b; 


0 when (a,b) € [0, 1[x[0,1] 
min(a,b) otherwise 


Drastic product: a 8&4 b = { 
Salient s-norm functions are: 

Gödel s-norm: a, b = max(a, b); 

Bounded sum or Lukasiewicz s-norm: a,b = min(1,a+ b); 


Algebraic sum or product s-norm: a @, b= a + b — ab; 


1 when (a,b) €]0, 1]x]0, 1] 


Drastic simata b= { max(a,b) otherwise 
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We say that a t-norm ®, is weaker than a t-norm 82 (denoted 8%, < 82) iff 
for all (a,b) € [0,1]? we have that a 81 b < a 82 b. Similarly, an s-norm Qi 
is weaker than an s-norm 2 (denoted 61 < 62) iff for all (a,b) € [0,1]? we 
have that a 9, b < as b. 

We say that & is idempotent iff a Ya = a for all a € [0,1], and sub- 
idempotent iff aa < a for all a € [0, 1]. Similarly, we say that O is idempotent 
iff a@a = a for all a € [0,1], and sup-idempotent iff ada > a. An idempotent 
element is a value a € [0,1] such that a & a = a. So, 0 and 1 are idempotent 
elements. The numbers 0 and 1, which are idempotent elements for any 9, 
are called trivial. 

The following important properties can be shown about t-norms. 


Proposition 22 (See [241]). 


1. There is the following ordering among t-norms (8 is any t-norm): 
Sa L 8 < Wy 
Sa < 8I S Op < Dg - 
2. The only idempotent t-norm is 8g. 
3. The only t-norm satisfying a a = 0 for all a € [0,1| ts Qa. 
There is also an analogue of Proposition 22 for s-norms. 
Proposition 23 (See [241]). 


1. There is the following ordering among s-norms (® is any s-norm): 


g 


IN IA 


g 


2. The only idempotent s-norm is Bg. 
3. The only s-norm satisfying a Ba = 1 for all a €]0, 1] is Da. 
Idempotent elements can be characterized as follows: 


Proposition 24 ([241]). a € [0,1] îs idempotent for Y iff for all b € [a, 1], 
a®b=min(a,b). If is continuous then a € [0,1] is idempotent for 8 iff for 
all x € [0,1] we have that a ® b = min(a, b). 


A t-norm & is called nilpotent if it is continuous and each a in the open 
interval (0,1) is a nilpotent element, i.e., there is a natural number n such 
that a8...8 a (n times) equals 0. a € (0,1) is called a zero divisor of Y if 
there exists b € (0,1) such that a & b = 0. 


Remark 16. 
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1. Each a € (0,1) is both a nilpotent element and a zero divisor of Y, and 
8a- 


2. Qg has neither nilpotent nor zero divisors and ®, and Sa have only 
trivial idempotent elements. 


3. The product Sp has neither non-trivial idempotent nor nilpotent ele- 
ments nor zero divisors. 


4. Ifa is an idempotent element, so is a”, i.e., a®...@a (n times) and, 
thus, no element in (0,1) can be both idempotent and nilpotent. 


5. Ifa is nilpotent for & then it is also a zero divisor for & and there is 
always b € (0,1) such that b&b = 0. 


6. Ifa € (0,1) is nilpotent (a zero divisor) for Y then each (0,a) is also a 
nilpotent (a zero divisor) for 8. Therefore, the set for nilpotent elements 
and the set of zero divisors for ® is either the empty set, as for Sy and 
Sp, or an interval of the form (0,c) or (0, c]. 


7. Qı is nilpotent. 

Proposition 25 ([241]). The following are equivalent: 
1. ® has zero divisors 
2. ® has idempotent elements 


A t-norm @® is called strict if it is continuous and strictly monotone, i.e., when- 
ever a > 0 and b < c then 
a®b<a®c. 


For instance, 9, is strict. It is known that: 
Proposition 26 ([241)). 
1. If © is strictly monotone then it has only trivial idempotent elements. 


2. If ® is strictly monotone then it has no divisors (i.e., c®y = 0 if either 
x =0 ory=0). 


8.1.3.2 Dual Norms 


There is a well-known property that allows us to derive a related s-norm from 
any t-norm. 


Proposition 27 (See [241]). A function & is an s-norm iff there is a t-norm 
8 such that for all (a,b) € [0,1]? we have that 


a9b=1-(1-a)8 (1-9). (8.11) 
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An s-norm as defined by Equation (8.11) is called the dual s-norm of Y. Vice 
versa, from any s-norm ® we may define a t-norm Y via 

agb=1-(1-a)0 (1-0). (8.12) 


A t-norm as defined by Equation (8.12) is called the dual t-norm of O. 
It is easily verified that for any of the t-norms ®; and s-norms $, defined 
above (i € {g,1,p,d}) we have that for all (a,b) € [0,1]? 


aib = 1-(1-a)8:(1-b) (8.13) 

ab = 1-(1-a)@;(1-)). (8.14) 

Given ®, 6, and ©, we call a tuple (9, $, ©) a dual triple iff for all a,b € [0, 1] 
o(a8b) = Sageb (8.15) 

e(aeb) = Ca@cb. (8.16) 


These equations describe the de Morgan’s laws for fuzzy sets. 
It is easily verified that 


(8g, Bg, 1) 
(@1, Di, ©1) 
(Sp, Dp» O1) 
(Sa, Da, O1) 


are dual triples. The following can be shown (see, e.g., [242]) 
The following can be shown (see, e.g., [242]) 


Proposition 28. 
1. (94, 99,9) and (Sa, Ba, S1) are dual triples for any ©; 
. (9,0,91) is a dual triple for any dual s-norm 8 of 8; 


2 

3. (9,8,91) is a dual triple for any dual t-norm 8 of O; 

4. If (@,®,©) is a dual triple satisfying the law of excluded middle and 
the law of contradiction, then (89, 9,09) does not satisfy the distributive 
laws. 

8.1.3.3 Distributive Norms 

Given 9, O, we say that @ is distributive over © iff for all a,b,c € [0, 1] 
a®(b@c) = (a®b)@(a@c) (8.17) 
and that © is distributive over ® iff for all a,b,c € [0,1] 
a®(b@c) = (a@b)@(aGc). (8.18) 


If both Q is distributive over © and 4 is distributive over € then the tuple 
(8, ®) is called a distributive pair. 
The following can be shown (see, e.g., [242]) 
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Proposition 29. 

1. O is distributive over Q iff ® = 8g. 

2. ® is distributive over © iff O = Dg. 


3. (Qg, Dg) is the only distributive pair. 


8.1.3.4 T-Norm Representation Theorem 


A major achievement about t-norms is the fact that any continuous t- 
norm is piece-vice isomorphic to either Gódel, Lukasiewicz, or product t-norm, 
making these three t-norms among the most important of the t-norm family. 
We address this property within this section. 

A t-norm ® is called Archimedean if it has the Archimedean property, 
i.e., if for each a,b in the open interval (0, 1) there is a natural number n such 
that a 8... 8 ais less than or equal to b. Note that €, is not Archimedian, 
while ®; and Q, are. 


Proposition 30 ([241)). 


1. If Q is right continuous and has only trivial idempotent elements then 
it is Archimedian. 


If & is strict then it is Archimedian. 
If each a € (0,1) is nilpotent for ® then ® is Archimedian. 
If & is Archimedian then Y is left-continuous iff Y is continuous. 


. © is Archimedian iff the only idempotent elements are 0 and 1. 


D a os y 


. A continuous Archimedean t-norm is strict if 0 is its only nilpotent ele- 
ment; otherwise it is nilpotent. 


The following proposition characterizes Archimedian t-norms: 


Proposition 31 ([241)). 
Let & be an Archimedian t-norm. The following are equivalent: 


1. Q is nilpotent. 
2. There exists some nilpotent element for Q. 
3. There exists some zero divisor for ®. 


4. Q îs not strict. 
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Now, note that for each continuous t-norm ®, the set E of its idempotents 
is a closed subset of [0, 1]. Its complement, the set of all elements which are 
not idempotent, is therefore a union of a set Zopen(£) of countably many 
non-overlapping open intervals I € Z(E). Let us define Z(E) as the set: 


[a,b] € T(E) iff (a,b) € Topen(E) . 
For I € Z(E), let us denote with ®); the restriction of ® to J. The follow- 


ing proposition characterizes fully continuous t-norms and is also called the 
Mostert-Shields theorem [321].1 


Proposition 32 ([186, 241]). Let & be continuous. Then 
1. for each I € T(E), ® is isomorphic to an Archimedian t-norm; 


2. ifa,be [0,1] are such that there is no I € T(E) such that a,b € I, then 
a Qb = min(a, b). 


The above proposition tells us that any continuous t-norm ® behaves as an 
Archimedian t-norm within an interval I € Z(E), while for two points not 
included within any interval I € Z(E), 8 is @,. 

Now, it is well-known that 


Proposition 33. For any continuous Archimedian t-norm (Y we have that: 
1. if 9 is strict then ® is isomorphic to Sp; 


2. if Q is not strict, i.e., by Proposition 31, ® is nilpotent, then ® is iso- 
morphic to Qi. 
Hence, combining Propositions 32 and 33 we get 
Proposition 34 ([186, 241]). Let & be continuous. Then 


1. for each I € T(E), 81 is isomorphic to either product or Lukasiewicz t- 
norm; 


2. ifa,be [0,1] are such that there is no I € T(E) such that a,b € I, then 
a&b = min(a, b). 


Eventually, we conclude with the representation theorem of Mostert-Shields. 
So, let us introduce the notion of ordinal sum. Let (&a)aca be a family of 
t-norms and ((aa,ba))aca be a family for non-empty, pairwise disjoint open 
sub-intervals of [0, 1]. Then the following function 


da + (ba — aa): (25 Sa (25) if (a,b) € (aa, ba) X (aa, ba); 
min(a, b) otherwise . 


is a t-norm and is called the ordinal sum of summands (aa, ba, Da), A E A. 
Putting Propositions 32 - 34 we get the Mostert-Shields characterization 
of continuous t-norms: 


lWe say that a t-norm Qi is isomorphic to a t-norm @2 iff there is a strictly increasing 
function f such that a @1 b = f7*(f(a) 82 f(b)). 


116 Foundations of Fuzzy Logic and Semantic Web Languages 


Proposition 35 (Mostert-Shields t-norm characterization [241]). The fol- 
lowing are equivalent: 


1. 8 îs a continuous t-norm; 


2. Q is uniquely representable as an ordinal sum of continuous Archimedean 
t-norms; 


3. there is a uniquely determined countable family ((da,ba)) cea 
empty, pairwise disjoint open sub-intervals of [0,1] such that 


of non- 


(a) if ® is strict on I = (aa,ba) then jr is isomorphic to product 
t-norm Sp; 


(b) if S is not strict, i.e., nilpotent, on I = (aa,ba) then Or îs iso- 
morphic to Lukasiewicz t-norm @1; 


(c) if a,b € [0,1] are such that there is no I = (aa,ba) such that 
a,b E€ I, thena®b=a®, b. 


We may quote the above proposition as: 


“A t-norm is continuous iff it is isomorphic to an ordinal sum of 
Gödel, Lukasiewicz, and product t-norm” , 


which explains why Gödel, Łukasiewicz, and product t-norm are considered 
the most important ones with fuzzy logic. A similar characterization is not 
known for non-continuous t-norms. 

We summarize: 


e ¿y is the only idempotent t-norm, and is neither Archimedian, nor strict 
nor nilpotent; 


e Q; is Archimedian and nilpotent, and is neither idempotent, nor strict; 
e p is Archimedian and strict, and is neither idempotent, nor nilpotent. 


The above properties are summarized in the following Table 8.4: 


TABLE 8.4: Some properties of t-norms. 


t-norm Archimedian idempotent nilpotent strict 
Sg ° 
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TABLE 8.5: Properties for implication and negation functions. 


Axiom Name Implication Function 
Boundary condition 0=>b=1 

a>1=1 

1>0=0 
Antitonicity ifa < b, then a > c> b>c 
Monotonicity ifb<c,thnasb<a>c 


8.1.4 Fuzzy Implication 
In fuzzy set theory and practice, an important notion is the concept of 
fuzzy implication. A fuzzy implication, denoted =, is a function 


=: [0,1] x [0,1] > [0,1] 


and corresponds to a generalization of the notion of logical implication over 
{0,1} defined as a > b is 1 (true) iff a < b. 

As for t-norms, also for implication functions some axioms have to be sat- 
isfied and are illustrated in Table 8.5. Essentially, > should be not increasing 
in its first argument, while not decreasing in its second argument. 


In the literature, many implication functions have proposed (see, e.g., [242, 
Section 11.2]), among which the salient ones are: 


ts E E $1 ifa<b 
Gödel implication: a >, b= { ay ee 
Lukasiewicz implication: a >, b = min(1,1 — a + b). 


1 ifa<b 
duct implication: b= = 
Gougen or product implication: a >, { as 


Kleene Dienes implication: a za b = max(1 — a,b). 


Reichebach implication: a =, b = 1 — a + ab. 


b ifa= 1 
Drastic s-implication: a > , b = l-a ifb=0 

1 otherwise . 

b ifa=1 


Drastic r-implication: a => g, b = { L otherwise: 
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Besides the possibility to define an implication function directly, there are 
methods to define them indirectly. One such method consists of departing 
from a s-norm and a negation function and defines 


a=>b=0a00b, (8.19) 


which generalizes the classical implication =a Vb. These implications are called 
usually s-implications in the literature. 


Remark 17. It is easily verified that =ka, =r, =>, and as are all s- 
implications, all based on standard fuzzy negation O, and s-norms Ðg, Op, Pi 
and Ba, respectively. 


Similarly as for t-norms, we say that an implication > , is weaker than => iff 
a =>1 b < a =>2 b for all (a,b) € [0,1]. It can be shown that 


Proposition 36 ([242]). 


1. Given a negation O, then for s-norms 8 and 82 such that 8 


Let =>, and =>2 their corresponding s-implications. Then =>, 
holds. 


2. The following relation between S-implication holds: 


kd S rw a S ds + 


Another method, based on a t-norm only, gives rise to so-called r-implications 
and is defined as follows. Let ® be a continuous t-norm. Then the r-implication 
w.r.t. ©, is defined as 


a=>b=sup{c|a@c< b}. (8.20) 


Remark 18. It is easily verified that >¿,>1,>p and >a, are all r- 
implications, based on t-norms Bg, Pi, Pp and Ga, respectively. >) is the only 
implication that is both an s-implication and an r-implication. Note also that 
a=b= 1l iffa< b and that 1 >b=b. 


It can be shown that 
Proposition 37 ([242]). 


1. Given t-norms 81 and ®2 such that 891 < 82. Let =>, and = 2 their 
corresponding r-implications. Then >, > =>2 holds. 


2. The following relation among r-implications holds: 


< S1 ar + 


An important property of r-implication is the following [186]. 
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Proposition 38. For a given continuous t-norm (Y there is an unique impli- 
cation a = b satisfying, for all a,b,c € [0,1], the condition 


a®c<biffasb>c (8.21) 
namely, the r-implication a > b = sup{c|a@c< c}. 


We point out that in fact, given a continuous t-norm & and its r-implication 
=, then we have that 


azb = a®(a=>b) (8.22) 
adyb = ((a>b)> 5) @, (06> 4a)> a), (8.23) 


which essentially allows us to define Gödel t-norm and s-norm, in terms of 
any continuous t-norm Y. 
We also can define a negation function from an r-implication as 


Sa = a=>0. (8.24) 
It easily verified that 
e Lukasiewicz negation is obtained from Lukasiewicz r-implication; 
e Gödel negation is obtained from Gödel, or product r-implication. 


Additionally, we have the following inferences we will use extensively in this 
book: let a > n and a > b > m. Then, 


With Kleene-Dienes implication, we have that “if n > 1 — m then 
b> m”. 


More importantly, to what concerns our work, is that under an r-implication 
relative to a t-norm 8, we have that 


from a > n and a > b > m, we infr b> ngm. (8.25) 


To see this, as a > n and a > b = sup {c | a 8c < b} =c > mit follows 
that b>aQc>n8m. In a similar way, under an r-implication relative to a 
t-norm 8, we have that 


from a > b > n and b > c > m, we infer that a > c> ngm. (8.26) 


A final concept related to implications is related to the definition of inclusion 
degree among a fuzzy sets. We have seen that the standard inclusion among 
a fuzzy sets A, B € 2% is crisp (see Equation (8.10)): either A C B or AZ B. 
We may fuzzify C in the following way. Let => be an implication functions. 
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TABLE 8.6: Some additional properties of combination functions of various 
t-norms. 


Property Lukasiewicz Gödel Product | Zadeh [472] 


r@Qor=0 . ° e 
Leor=l e 
LOr=2x 
TOTS T 
OOT=T 
T=>Y=O0TIODY 
z > y= Oy > OL 
S(1>y)=1800y 
9(180y)=9100y 
(19y)=0780y 


Then the degree of inclusion between fuzzy sets A,B € 2%, denoted A 3 B 
is defined as 
int A(x) > B(x). (8.27) 


We conclude this section with the following important remark. Table 8.6 
recalls some salient properties of the various settings. In the table, the column 
name identifies the t-norm 8, from which the dual s-norm $ is considered, 
according to Equation (8.11), and the negation function is determined via the 
r-implication by Equation (8.24). The “Zadeh” column is an exception and 
refers to the set of functions (99, Dg, Sa O1)- 

As we may see, none of them satisfy all the properties. More important 
is the fact that a set (8, O, ©) of functions satisfying all the listed properties 
in the upper part has necessarily to collapse to the Boolean, two-valued, case 
{0,1} [143]. As a note, [146] claimed that fuzzy logic collapses to Boolean 
logic, but didn’t recognize that to prove it, all the properties of Table 8.6 had 
been used. 


8.1.5 Fuzzy Relation 


We say that a binary fuzzy relation is a fuzzy set R € 2***. The inverse 
of R has membership function R7*(y, x) = R(x, y), for every x,y € X. The 
composition of two fuzzy relations Ri and Ra is defined as (Ra o Ra)(x,z) = 
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supyex Rilz, y) 8 Roly, z). A fuzzy relation R is transitive iff R(x, 2) > (Ro 
R)(a, z). 


8.1.6 Aggregation Operators 


Aggregation Operators (AOs) (see, e.g., [443] and [242, Section 3.6]) are 
mathematical functions that are used to combine information. There exists 
a large number of different AOs that differ on the assumptions on the data 
(data types) and about the type of information that we can incorporate in the 
model. To what concerns us, an AO of dimension n is a mapping Q : R” > R 
that satisfies: 


1. Q(a) = a (idempotent if unary); 
2. @(0,...,0) =0 and Q(1,...,1) = 1 (boundary conditions); 
3. Q(az,...,4n) < @(bi,..., bn) if Vi, a; < bi (monotone). 
Note that we always have that 
min(@1,...,@n) < Q(a1,..., an) < max(ai,...,@n) . 


Often, an AO Q is parameterized with a vector of n weights W = [w1,..., Wn] 
such that w; € [0,1] and $; w; = 1. In that case we will denote the AO as 
Qw. 

Examples of AOs are the arithmetic mean, 


1 
QWE(27,...,4n) ==) a, (8.28) 
NE 


the weighted sum 
QW (a1,-.--;@n) = 5 Wili , (8.29) 


and the Ordered Weighted Averaging (OWA) operators and the quantifier- 
guided OWAs that we shall present next. 

The OWA operators [464, 466, 468] provide a parameterized class of 
mean type AOs. Formally, an OWA operator of dimension n is an AO such 
that 


O *(a1,...,4n) = X w3b;, (8.30) 
j 


where 6; is the j-th largest of the as. 

A fundamental aspect of these operators is the reordering step. In par- 
ticular, a weight w; is not associated with a specific argument but with an 
ordered position of the aggregate. By choosing different W we can implement 
different AOs. The OWA operator is a non-linear operator as a result of the 


122 Foundations of Fuzzy Logic and Semantic Web Languages 


process of determining the b;. An OWA operator @ is a mean operator and is 
also symmetric and idempotent: 


Symmetric : Q(az,..., An) = Q(Az(1), +++, Qr(n)) 
Idempotent : Q(a,...,a)=a 


where 7 is a permutation. Notable OWA operators are: 


F(ar,...,a,) =max(a1,...,an) for W = [1,0,...,0] 
F(ar,...,a,) =min(a1,...,a,) for W = [0....,0,1] 
F(az,...,an) =avg(ai,...,@n) for W = [1/n,1/n,...,1/n] 
f(a,...,an) =med(a1,...,a,) for wi =0, 


n odd and Wn+1)/2 = 1, or 
n even and W,/2 = 0.5 =W,/241 


In the equations above, avg and med are the average and median value of the 
ai, respectively. 

Next we recap quantifier aggregations [242, 465, 466, 467]. Classical logic 
has two quantifiers, the universal V and the existential 4 quantifier. These are 
extremal ones between several other linguistic quantifiers such as most, few, 
about half, some, many, etc. Quantifiers can be seen as absolute of propor- 
tional (see, e.g., [242]). To what concerns us, we consider the proportional 
ones. In this case, a proportional type quantifier, such as most, can be repre- 
sented as a fuzzy subset Q : [0,1] — [0,1] such that for each r € [0,1], the 
membership grade Q(r) indicates the degree to which the proportion r satis- 
fies the linguistic quantifier that Q represents. See Figure 8.7 for examples of 
fuzzy quantifiers. 

An important class of quantifiers are the monotone quantifiers that satisfy 
the following conditions: 


1. Q(0) =0, 


3. Q(r1) < Q(ra) if TI < To. 


Essentially, these quantifiers are characterized by the idea that as the propor- 
tion increases, the degree of satisfaction does not decrease. For two quantifiers 
Qi, Q2, we write Qi < Q2 if for all r, Qi(r) < Qa(r). We recall that the two 
classical quantifiers 3, Y may be defined as the monotone quantifiers Qy(r) = 0 
ifr #1 and Qa(r) = 1 if r 4 0, respectively. Note that for any monotone 
quantifier Q, we have that 


Qy <Q< Qs (8.31) 


dictating that V and J are indeed the lower and upper bounds of monotone 
quantifiers. 
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most 


about half 


Sa 


Almost all 


0 0.5 1 


FIGURE 8.7: Some examples of fuzzy quantifiers. 


To introduce quantifier-guided aggregation, we consider first the evaluation 
of quantified propositions. So, let Y be a (crisp) set of objects and B some 
fuzzy subset of Y. A quantified proposition is a statement of the form 


QY'sare B . 


An example of such a statement is “most students are young.” Yager [465] 
suggested an approach to evaluate the truth of such quantified propositions 
using OWA. In the following we will assume that Q is a monotone quantifier. 
The first step consists in associating to Q an OWA weighting vector Wo of 
dimension n = |Y|, where |S| is the cardinality of a denumerable set S. The 
weights are obtained as 


w=0b-a). (8.32) 


Note that for Q monotone, wj € [0,1] and >’, wj = 1. Now, we can obtain 
the truth t of the statement “QY’s are B” as 


t = 07 (01,-.-; Un) (8.33) 


where a; = B(y;), yi € Y and Owe is the OWA operator obtained using the 
weights Wg found from @ as above. 

We turn now to the issue of quantifier-guided aggregation, that has been 
shown to be useful for evaluating multi-criteria decision problems [38]. So, 
assume X to be a set of alternatives and we have a collection of n criteria 
A; that are of concern in a given problem. For x € X, with A(x) we indi- 
cate the degree to which the i-th criterion is satisfied by alternative zx. It is 
easily verified that the degree to which all criteria are satisfied by x can be 
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determined by D(x) = min(4:(x),..., An(x)) while the degree to which some 
criteria is satisfied by z can be determined by D(x) = max(41(x),..., An(2)). 
In general, for a monotone quantifier Q, the degree to which 


Q of the criteria are satisfied by z 


can be formalized as 


D(x) = Owo (A1(2), ..., An(£)) , (8.34) 


where Owe is the OWA operator obtained using the weights Wg found from 
Q. 
Thus, by selecting an appropriate weighting vector W, we are essentially 
implementing a kind of quantifier-guided aggregation. 
We refer the reader to [443] for an in-depth discussion on the matter. 


8.1.7 Matrix-Based Fuzzy Set Operations 


We have seen that t-norms play an important role in the specification 
of fuzzy set operations such as intersection, union, and negation. A main 
feature is that they rely on some reasonable axioms one expects such fuzzy set 
operations should satisfy. In this section, we provide another setting, especially 
addressed by Hahnle [181, 182, 183, 184] related to many-valued logics, and 
appears to be particularly interesting for practical cases. In fact, we replace the 
set of membership degrees [0,1] with the finite one Ln = {0, = Gien z2, 1} 
(see Equation (8.6)). 

This is not surprising as usually computers cannot deal with [0,1], but 
rather with Ln for a reasonable large n. 

In this setting, a fuzzy set operation, called fuzzy combination function, is 
simply defined as k-ary function 


Fen) F Daa (8.35) 


So, for instance, intersection and union (resp. &,®) may be seen as binary 
functions, which may be extended to the n-ary case, while set complement 
(9) is an unary function. 

As Ln is finite, one way to represent a fuzzy combination function f is 
by means of a matrix Mp with k + 1 columns and n* rows, where the first 
k columns are the arguments of the function and the last column contains 
the evaluation of the function on the arguments. As for k arguments, each of 
which may have n values, the number of rows of the matrix is nf. Of course, 
if the a binary function f : Ln x Ln is commutative, i.e., f(x,y) = f(y, x) 
then we need not to represent all rows. 
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For instance, for n = 5, the matrix for standard negation is 


0o 1 
0.25 0.75 

Mo,=| 05 05 
0.75 0.25 

1 0 


Similarly, the t-norm @; can be represented as 


0 o0 0 
0 025 0 
0 05 0 
0 0.75 0 
0 1 0 
0.25 0 0 
0.25 0.25 0 
0.25 05 0 
0.25 0.75 0 
0.25 1 0.25 
05 0 0 
0.5 0.25 0 
Me =| 05 05 0 


In a similar way, we can represent the t-norm Sy, and the s-norms Q/ 
and yg. 

It is worth noting that the product t-norm ®, and s-norm ©, cannot 
be finitely represented, unless n = 2 (the product - is not closed on Ln for 
any n > 3). So, for n > 3 a matrix representation of the product is some 
approximation x - y of a real product x - y, which is exactly what happens 
usually on computers. For instance, we may decide to approximate &, via a 
rounding function, e.g., we round x - y to the closest value in Ln and in case 
of a tie, we round to the lower value: specifically, for x, y € Ls, we may define 
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tn y via the matrix 


0 0 0 
0 0.25 0 
0 0.5 0 
0 0.75 0 
0 1 0 
0.25 0 0 
0.25 0.25 0 
0.25 0.5 0 
0.25 0.75 0.25 
0.25 1 0.25 
0.5 0 0 
0.5 0.25 0 


1 0 0 
1 0.25 0.25 
1 0.5 0.5 
1 0.75 0.75 
1 1 1 


Note that -n is commutative, monotone, satisfies the boundary conditions, but 
is not associative, thus, -, is not a t-norm: e.g., 


0.25 -n (0.75 -n 0.75) = 0 4 0.25 = (0.25 -n 0.75) +, 0.75 . 


Another, more direct, approach to deal with fuzzy combination functions is to 
rely directly on built-in functions provided by an underlying computer system, 
rather than relying on matrix representation. 


8.1.8 Fuzzy Modifiers 


Fuzzy modifiers are an interesting feature of fuzzy set theory. Essentially, 
a fuzzy modifier, such as very, more_or_less, and slightly, apply to fuzzy 
sets to change their membership function. 

Formally, a fuzzy modifier m represents a function 


fim: [0,1] > [0,1] . 


For example, we may define frery(x) = x? and fetigntiy(v) = vz. In this way, 
e.g., by referring to Example 24, we may express the fuzzy set of employees 
having a very low salary by applying the modifier very to the fuzzy member- 
ship function of “low,” i.e., 


Lvery low(2) = Fuery(M1ow(2)) = (Hiow(x))* = (Is(2000, 2500) (2))? . 
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FIGURE 8.8: Linear modifier Im(a, b). 


The typical shape of modifiers we will consider here are so-called linear mod- 
ifiers, as illustrated in Figure 8.8. 

Note that such a modifier can be parameterized by means of one parameter 
c only, i.e., lm(a, b) = Im(c), where 


a=c/(c+1),b=1/(c+1). 


8.2 Mathematical Fuzzy Logic Basics 


Given that SWLs are grounded on mathematical logic, it is quite natural 
to look at Mathematical Fuzzy Logic [186] to get inspiration for a fuzzy logic 
extensions of SWLs. So far, we have put some effort on basic notions of fuzzy 
set theory. Mathematical fuzzy logic is based on fuzzy sets as much as classical 
logic is based on crisp set theory. 

The aim of this section is to recap salient notions of mathematical fuzzy 
logic. An in-depth discussion can be found in [186]. 


8.2.1 From Classical Logic to Mathematical Fuzzy Logic 


In the setting of many-valued logics (see also [172, 183]), the convention 
prescribing that a statement is either true or false is changed and is a matter 
of degree measured on an ordered scale that is no longer {0,1}, but [0,1]. 
The conceptual shift reflects the shift from classical crisp sets to fuzzy sets. 
For example, the compatibility of “tall” in the phrase “a tall man” with some 
individual of a given height is often graded: the man can be judged not quite 
tall, somewhat tall, rather tall, very tall, etc. Changing the usual true/false 
convention leads to a new concept of statements, whose compatibility with a 
given state of facts is a matter of degree and can be measured on an ordered 
scale S that is no longer {0,1}, but, e.g., the unit interval [0, 1]. This leads to 
identifying “fuzzy statements” ¢@ with a fuzzy set of possible states of affairs; 
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the degree of membership of a state of affairs to this fuzzy set evaluates the 
degree of fit between the statement and the state of facts it refers to. This 
degree of fit is called degree of truth of the statement ¢ in the interpretation Z 
(state of affairs). Many-valued logics provide compositional calculi of degrees 
of truth, including degrees between “true” and “false that correspond to the 
fuzzy set operations of conjunction, union, and negation. So, a sentence is 
now not true or false only, but may have a truth degree taken from a truth 
space S, usually [0,1] or Ln for an integer n>3. In the sequel, we assume 
S = [0,1]. 

In this section, fuzzy statements have the form (¢,r), where r € [0, 1] (see, 
e.g., [183, 186]) and ¢ is a FOL statement, which encodes that the degree of 
truth of ¢ is greater or equal r. FOL statements are (see, e.g., [67, 93, 158, 
167, 378]), inductively defined from atomic formulae as usual (where ¢ and y 
are FOL formulae): 


PAV, Vy, 74, 60%, 796, 


Jx.p , VIP. 


Semantically, a many-valued interpretation T is similar to a classical interpre- 
tation, which rather than mapping any atomic statement into {0,1}, maps 
now maps each atomic formula A into [0,1] and is then extended inductively 
to all statements as follows: 


T(gAv) = L$) OT) 

T(V) = T(P) SDT) 

Z(p—v) = L$) =T) 

I(povy) = Toy) ely?) (8.36) 
T(—9) = eZ(0) 

T(3z.9) = SUPaeaz Li (0) 

T(V2.p) = infuecarTi(ó), 


where A? is the domain of Z, the interpretation Z% is as Z except that 
x is mapped into a, and now, not surprisingly, Y, 6, >, and © are a t- 
norm, s-norm, implication function, and a negation function, respectively. 
This allows us to extend the classical Boolean conjunction, disjunction, im- 
plication, and negation, respectively, to the many-valued case. Often, the im- 
plication function = is defined as r-implication (see Section 8.1.4), that is, 
a = b = sup {c |a &c< b}, but this need not be the case. 
We also consider here the following abbreviations: 


HAY E alo > y) (8.37) 
Veh E (>Y) => 4) ^ (Y => $) > Y) (8.38) 
-00 = 630. (8.39) 
(8.40) 
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TABLE 8.7: Combination functions of various fuzzy logics. 


Lukasiewicz Logic Godel Logic Product Logic SFL 
agb max(a +b — 1,0) min(a, b) a-b min(a, b) 
apb min(a + b, 1) max(a, b) a+b=a-b max(a, b) 
1 ifa<b 
a=>b min(1 — a+ 6,1) min(1, b/a) max(1 — a, b) 
b otherwise 


1 ifa=0 1 ifa=0 
Sa l-a f | l-a 
0 otherwise 0 otherwise 


In case > is the r-implication based on ®, then Ag (resp. Vg) is interpreted as 
Gédel t-norm (resp. s-norm), while —g is interpreted as the negation function 
related to Y (see also Equation (8.24)). 

As for fuzzy set theory, where we have seen that Lukasiewicz, Gódel, and 
product t-norm are the major players (see Proposition 35), in mathemat- 
ical fuzzy logic, one usually distinguishes three different logics, namely 
Lukasiewicz, Gödel, and product logics [186], whose combination functions 
are reported in Table 8.7. In the table, we call Standard Fuzzy Logic (SFL), 
the logic with fuzzy combination functions ®,,6,,0, and impfxa. By Equa- 
tions (8.37) and (8.38), in fact standard fuzzy logic is entailed, e.g., by 
Lukasiewicz logic, as min(a,b) = a &; (a =; b) and max(a,b) = 1 — min(1 — 
a, — b) and, thus, is usually not considered in the mathematical fuzzy logic 
literature. 

It is also worth nothing that these logics satisfy the same properties as illus- 
trated in Table 8.6 and that there is no fuzzy logic satisfying all the properties 
listed in this table (otherwise it has to collapse to classical logic). 

A fuzzy interpretation Z satisfies a fuzzy statement (¢,r) or T is a model 
of (¢,r), denoted ZF (¢,r) iff Z(¢) > r. We say that two formulae ¢ and y 
are equivalent, denoted w = ¢, if for any fuzzy interpretation Z we have that 
T(0) = T(w). We say that Z is a model of ¢ if Z(0) = 1 and ¢ is a tautology iff 
all fuzzy interpretations are models of ¢, i.e., y = 1. A fuzzy knowledge base 
(or simply knowledge base, if clear from context) is a set of fuzzy statements 
and an interpretation Z satisfies (is a model of) a knowledge base, denoted 
TERK, iff it satisfies each element in it. 

We say (¢,n) is a tight logical consequence of a set of fuzzy statements K 
iff n is the infimum of Z(6) subject to all models Z of K. Notice that the latter 
is equivalent to n= sup {r | K — (¢,r)}. n is called the best entailment degree 
of @ w.r.t. K (denoted bed(K, ¢)), i.e., 


bed(K, e) = sup {r| K E (¢,r)} . (8.41) 
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On the other hand, the best satisfiability degree of d w.r.t. K (denoted 
bsd(K, $)) is 
bsd(K,6) = supt 79) |Z EX) . (8.42) 


Of course, the properties of Table 8.6 immediately translate into equivalence 
among formulae. For instance, the following equivalences hold (in brackets we 
indicate the logic for which the equivalence holds) 


~ = ¢ (L) 
^b = q (G) 
(PAG) = 1 (£,G, I) 
$v- = 1 (E). 


We would like to point out some crucial difference between fuzzy logic and 
classical FOL, not illustrated so far. 


Remark 19. Unlike the classical case, in general, we do not have that Vx.o 
and 2312.54 are equivalent. They are equivalent for Lukasiewicz logic and SFL 
(see later on, Remark 21), but are neither equivalent for Gódel nor for Product 
logic. For instance, under Godel negation, just consider an interpretation T 
with domain {a} and Z(p(a)) = u, with 0 < u < 1. Then T(Vx.p(1)) = u, 
while T(=3x.p(1)) = 1 and, thus, Vx.p(1) £ =31.p(x). 

On the other hand, it can be shown that (see later on, Remark 21), e.g., 


Vi.) = 31.4 . 


We conclude, by pointing out that if the connective — is interpreted as t- 
implication w.r.t. the t-norm 8, then the following graded deduction theorem 
is valid: 


KU{(¢,7), (6 > Y, 8)} E (Vr 8 8) . (8.43) 


In fact, consider an interpretation Z satisfying K, (¢,r) and (4 > y, s}. Then, 
from r < Z(9), from s < I(¢ > y) = T(¢) > L(y), from the monotonicity of 
8, and from Equation (8.21), we get 


Tw) > T(d)@s>r@s. 


This suggests also that the following graded deduction rule is correct: 


(9,7), (9 > Y, 8) 
(res) ` 


(8.44) 
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8.2.1.1 On Witnessed Models 


We say that a fuzzy interpretation Z is a witnessed interpretation iff 


I(3x.p) = I%(¢), for some a € A? (8.45) 
I(Va.p) = I%(@), for some ac A? . (8.46) 
That is 
I(31.9) = max I%(¢) 
ac At 
I(Va.¢) = min I%(¢) . 
ac At 


These equations say that the supremum (resp. infimum) are attained at some 
point for a witnessed interpretation. Now, unlike the classical case, it may not 
be true that Equation (8.45) and Equation (8.46) hold for any Z, i.e., T may 
not be witnessed. For instance, for Z with domain the natural numbers and 
T(A(w)) = 1 — 1/n, we have that 


T(31.A(x)) = sup, (A(2)) Sup 1-1/n=1, (8.47) 


while in no point Z(A(x)) is 1. So, Z is not witnessed (the argument for V is 
similar). 

The following important property can be shown (see, e.g., [186, 188, 189, 
190]). 


Proposition 39. In Lukasiewicz logic and, thus, in SFL, a fuzzy statement 
(6,r) has a witnessed fuzzy model iff it has a fuzzy model. This is not true for 
Godel and product logic. 
This proposition says that, for Lukasiewicz logic, we may restrict our attention 
to witnessed models only. That is, Lukasiewicz has the so-called witnessed 
model property (there is a model iff there is witnessed model). 

Of course, we also have that 
Proposition 40. If the truth space is finite then any fuzzy logic has the 
witnessed model property. 


Interesting is that (see [188]) 
Vaz. A(x) Aix. A(z) (8.48) 


has no classical model, but it has a fuzzy one, which has to be infinite. Indeed, 
in [188], it is shown that in Gödel logic it has no finite model, but it has an 
infinite fuzzy model. Of course, the above formula has no classical model. To 
see that it has a model under Gödel logic, consider the fuzzy interpretation Z 
with domain the natural numbers and Z?(A(x)) = 1/(n + 1). Then 


T(Va.A(x)) = inf Z;(A(2)) = 0, and, thus, Z(-Vx.A(x)) = 1 
T(3x.A(2)) sup Z% (2A(x)) = 0, and, thus, Z(231.A(1)) =1. 


II 


Therefore, Z is a model of ¢. 
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8.2.2 Reasoning 


We refer the reader to, e.g., [182, 183, 186] for reasoning algorithms for 
fuzzy propositional and first-order logics. For the purposes of this book, we 
illustrate here some salient approaches that turn out to be useful later on. 
Except for Section 8.2.2.1, where we present Hilbert style axiomatizations, all 
the other methods will refer to the propositional case only, which however is 
sufficient to illustrate the main concepts behind the reasoning methods. 


8.2.2.1 Axiomatizations 


At first, we start with a the typical Hilbert style axiomatization ap- 
proach [186]. The purpose of the axiomatization is to characterize tautologies, 
i.e., p is provable from the axioms iff g is a tautology, that is ¢ = 1. The 
axioms have also the benefit to allow to get a better understanding of a fuzzy 
logic, but is unlikely suitable for any use in a practical scenario. 

For the rest of this section, we restrict the logical connectives to A and >, 
where A is interpreted as a t-norm ® and — is its relative r-implication. Ag 
is a shorthand for $ — 0. Atomic formulae are restricted to be propositional 
letters. Additionally, a theory 7 is a set of formulae ¢, a model of T is a fuzzy 
interoperation being a model of all formulae in 7, and 7 entails ¢, denoted 
T = 6, iff any model of T is a model of ¢. Of course, we have that 


TF o iff (1) | VET} E (1) . 


Essentially, we focus our attention to fuzzy statements true to degree 1. 

As usual, with 7 | ( we denote the fact that a formula can be derived 
from a theory using the theory, a set of axioms’, and the deduction rule. With 
T Ha $ we denote that the formula ¢ is classically entailed by T. 


Basic Logic BL. At first, consider the following axioms of the so-called BL- 
logic: 
(A1) (¢> 0) > (Y > x) > (> x) 


( 

(A2) (^Y) > ¢ 
( 
( 


(A3) (910) > (YA 9) 

(A4) (9A(9= 0)) > (YA (1 4)) 

(A5a) (6 > (1 > x)) > ((9 A) > x) 
(A5b) (($1 ¥) > x)) > (^ (Y => xX) 

(A6) (¢A (b> x)) > (W > 4) >) >) 
(A7) 0+ ¢ 


2The set of axioms may vary from case to case and is indicated via a subscript. 
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(Deduction rule) Modus ponens: from ¢ and ¢ > Y infer y. 


We have that the following soundness and completeness result for BL-logic, 
where the t-norm can be arbitrary: 


Proposition 41 ([186]). Consider an arbitrary t-norm ®. Then T Fa & iff 
T = 9. Also, if T Fu 9 then T Ra 0, but not vice versa (e.9., Ea $ V 0, 
but Fa dV ~o). 


Note also that 


e T Fa ($ Ang) >0 
e T Fau ¢ > ~no, but T YBL 250 > 9, e.g., 6 = pV =p, t-norm is Gödel 


e Tha ($ > 4) > (AW > 6), but not vice versa. 


Lukasiewicz Logic E. We have seen that Proposition 41 provides us a sound- 
ness and completeness result for the BL-logic, in which we allow any t-norm 
to be considered to interpret the logical connective A. 

As next, let us fir the t-norm to be ®), ie., we are considering 
Lukasiewicz logic. Consider the following axiomatization: 


(Axioms) Axioms of BL 
(E) 9 > 6 
(Deduction rule) Modus ponens: from ¢ and ¢ > Y infer y. 


We have that the following soundness and completeness result for 
Lukasiewicz logic: 


Proposition 42 ({186]). Fix the t-norm to Qı. Then T +i o iff T 


l 
S 


Remark 20. Note also that 

$ >) & (54 > 79) 

e EE(PAD) & (HOV =y) 

e F (9 => Y) e (GAY) 

e F (9> Y) e ovy) 

e Hi (=($ > Y) e ($ ^=) 

e SFL is entailed by Lukasiewicz logic. 


e 
al 


Product Logic II. Let us fix the t-norm to be Sp, i.e., we are considering 
product logic. Consider the following axiomatization 
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(Axioms) Axioms of BL 

(IL) ——x > (Ax > AX) > ($ > Y) 

(112) (9,70) >0 

(Deduction rule) Modus ponens: from ¢ and ¢ > Y infer y. 


We have that the following soundness and completeness result for product 
logic: 


Proposition 43 ([186]). Fix the t-norm to 8p. Then T Fp pif T 


l 
S 


Note also that 
e Fp =p Ap) > 7G Ag Y) 
e F, (9 >=0) > 4 


e Tp ~no Vg =0. 


Gödel Logic G. Let us fix the t-norm to be Sy, i.e., we are considering 
Gödel logic. Consider the following axiomatization 


(Axioms) Axioms of BL 


(G) = (610) 
(Deduction rule) Modus ponens: from ¢ and ¢ > Y infer y. 


We have that the following soundness and completeness result for Gödel logic: 


Proposition 44 ([186]). Fix the t-norm to @,. Then T F ¢ iff T 0. 


Note also that 


e Fy ($ => 74) > 76 


Boolean Logic. One may ask how the various axiomatizations relate to 
Boolean, classical, two-valued propositional logic. To this end, consider the 
following axiomatization 


(Axioms) Axioms of BL 
(B) Q Vg 9 
(Deduction rule) Modus ponens: from ¢ and ¢ > Y infer y. 


We have that the following soundness and completeness result for classical 
Boolean logic: 
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Proposition 45 ([186]). Fix interpretations to be classical, two-valued. Then 


T Ebool (0) aff T i o. 


Note that Boolean logic extends Gödel logic as 


e Fool p > (pA $). 


Note also that the union of any two of our three main logics turns out to be 
equivalent to the classical Boolean logic: 


e L + G is equivalent to Boolean logic 


e L + II is equivalent to Boolean logic 


e G + II is equivalent to Boolean logic. 


Rational Pavelka Logic RPL. So far, we considered only formulae and 
their absolute truth, i.e., whether they have truth 1 in any model of a theory. 
In the following, we devise an axiomatization for the case of fuzzy statements, 
i.e., statements of form ($,r) under Lukasiewicz-logic. To do so, we introduce 
the so-called Rational Pavelka Logic (RPL) [347]. 

In this setting, we fix the t-norm to be @;, and we extend the so far 
considered propositional language to include, besides 0 and 1, also any rational 
r € [0,1] as atom in a formula. We denote the set of rational numbers in [0, 1] 
with [0, 1]g. Hence, e.g., (0.349) > 0.8 is a formula. We extend interpretations 
to rationals in the obvious way: Z(r) = r. We also assume that any sub-formula 
in which rationals occur only is recursively replaced with its evaluation using: 


TrAs H>H ros (8.49) 
ros > rss (8.50) 
ap ee? esp, (8.51) 


The axiomatizations of RPL is as for Lukasiewicz-logic and we have: 


Proposition 46 (see [186]). For RPL, T Frp ¢ iff T H ¢. 


The interesting point is the following observation: for any fuzzy interpretation 
T, for any formula ¢ and rational r € [0, 1] 
T(9) >riffZ(r > 4) =1 

T(o) < riff T(d>r)=1. 
That is, Z is a model of (4, r) iff it is a model of r > ¢, and, similarly, Z is 
a model of (0, r) iff it is a model of dp > (1 — r). As a consequence, from 
Proposition 46 we get immediately a sound and complete proof procedure for 
graded consequence under Lukasiewicz logic (and, thus, for SFL): for a set of 
fuzzy statements K, let 


Tk=1ftr>vlyvEeK)t. 
Then 
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Proposition 47 (see [186]). For a set of fuzzy statements K, 


K = (6, r) iff Tic Frar >o. 


The analogue of Proposition 47 fails for product logic and Gödel logic 
(see [186]). We refer the reader to [148, 149] for more on this issue. 

However, we may somewhat extend RPL, still guaranteeing soundness and 
completeness [186]. In fact, consider RPL extend with a binary logical combi- 
nation operator A, interpreted as product t-norm ®,. Let RPL(A,) this logic. 
Consider the evaluation of rationals as from Equations (8.49)-(8.51), plus the 
additional evaluation 


rT Ap St> TQpS, 
and, eventually, the axiomatization 
(Axioms) Axioms of RPL 
(RITL) (9 > Y) > (( ^p x) > (1 Ap X) 
(RII2) (9 > Y) > (x Ap $) > (x Av Y) 
(Deduction rule) Modus ponens: from ¢ and ¢ > Y infer y 
Then 
Proposition 48 ([186]). T Fpa) $ if T E o. 


Fuzzy First-Order Logic. Here we show how to provide an axiomatization 
of fuzzy FOL logic. In the following, we extend the propositional language by 
allowing predicates to occur as atomic formulae and quantified formulae are 
the form Vz.¢ and 32.4. A term t is either a variable or a constant.’ 

In the following, with CV, where C € { BL, E, II, G}, we denote the fuzzy 
FOL in which the underlying logic is either the BL logic, the Lukasiewicz logic, 
the product logic or the Gódel logic. Now, consider the following axiomatiza- 
tion 


(Axioms) Axioms of C 


(v p(x) > (t) (t substitutable for x in p(x)) 


1 


o(t) > 31.p(1) (t substitutable for x in ¢(x)) 


32 


1) Y 
(31) 
(Y2) Va.( > 9) > ( > Vr.0) (x not free in Y) 
(32) Va.(¢ > Y) > (32.4 > Y) (x not free in Y) 


3Note that we may introduce functions symbols as well, with crisp semantics (but is 
uninteresting), or we need to discuss also fuzzy equality (which we leave out here). 
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(V3) Vx.(9 Vg Y) + (Vx.P) Vg Y (x not free in Y) 
(Modus ponens) from ¢ and ¢ > w infer y 


(Generalization) from ¢ infer Vx.d, 


” 


where the notions of “t substitutable for x in p(x)” and “x not free in y” are 
defined as for classical FOL (see, e.g., [67, 93, 158, 167, 378]).* 

We have that the following soundness and completeness result for the 
CV logic: 


Proposition 49 ([186]). T Fe ¢ iff T Ed. 


Remark 21. Note that 


e rBL da.@ = Vx. 


o Fary =3x.p o> Vx.0 


1.0 O VI.) 


e tery (Vz.0— 6) 6 db > (Vz.0) if x does not occur in d 


e “priv (Gr. > 6) 6 6 > (2.0) if does not occuring . 


Predicate Rational Pavelka Logic (RPLV). The FOL variant is obtained 
from RPL in the obvious way: we fix Lukasiewicz t-norm and r-implication, 
formulae are as for LV, where rationals r € [0,1]g may appear as atoms and, 
formulae with rationals are only evaluated according to Equations (8.49)— 
(8.51), and a sound and complete axiomatization is the one as for LV. 


8.2.2.2 Operational Research-based 


An important and application oriented approach for reasoning with fuzzy 
logic is based on Operational Research (OR). To this end, we first recap some 
salient definitions related to OR. 


OR basics. In this section we recall basic notions related to Mixed Integer 
Linear Programming (MILP), Mixed Integer Quadratically Constrained Pro- 
gramming (MIQCP) and Mixed Integer Non Linear Programming (MINLP) 
optimization problems. 

A general Mixed Integer Linear Programming (MILP) [366] problem con- 
sists in minimizing a linear function with respect to a set of constraints that 
are linear inequations in which rational and integer variables can occur. More 
precisely, let x = (x1,..., £k} and y = (y1,...,Ym) be variables over Q and Z 
respectively, over the integers and let A, B be integer matrices and h an integer 


4Roughly, a constant is always substitutable, and a variable y is substitutable into ¢ for 
x if the substitution does not make any free occurrence of x into a bound occurrence of y. 
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vector. The variables in y are called control variables. Let f(x,y) be an k+m- 
ary linear function. Then the general MILP problem is to find z € Q*,y e Z™ 
such that f(2,y) = min{ f(x,y) | Ax + By > h}. 

The general case can be restricted to what concerns the paper as we can 
deal with bounded MILP (bMILP). That is, the rational variables usually 
range over [0, 1], while the integer variables range over 40, 1}. It is well known 
that the bMILP problem is NP-complete (for the belonging to NP, guess the y 
and solve in polynomial time the linear system, NP-hardness follows from NP- 
Hardness of 0-1 Integer Programming). Furthermore, we say that M C [0, 1]* 
is bMILP-representable iff there is a bMILP (A, B,h) with k real and m 0-1 
variables such that M = {a : dy € (0, 1)” such that Ax + By > h}. 

In general, we require that every constructor is bMILP representable. 
For instance, classical logic, SFL, and Lukasiewicz connectives, are bMILP- 
representable, while Gódel negation is not. In general, connectives whose 
graph can be represented as the union of a finite number of convex poly- 
hedra are bMILP-representable [216], however, discontinuous functions may 
not be bMILP representable. 

There are a lot of available tools for solving these problems, such as Cbc5, 
Gurobi® or lpsolve”. 

Concerning Mixed Integer Quadratically Constrained Programming 
(MIQCP), let x = (%1,...,8x) and y = (yi,...,Ym) be variables over Q 
and Z respectively. Now, for all ¿ € 0,1,...,n, let a; be an integer vector 
of length k, b; be an integer vector of length m, h; be an integer number 
and Qi(z,y) = 1/2- (x + y)" -C; - (x + y), where C; is a symmetric inte- 
ger matrix of dimension (k + m) x (k + m). Let f(x,y) be an k + m-ary 
linear function. The MICQP problem is to find z € Q*,y € Z™ such that 
F(z=,y) = min{ f(x,y) : ao: £+ bo: y > ho) subject to a set of n constraints of 
the form: a; - x + bi- y +Q;(2,y) > h; or a; -x+ bi- y + Qi(x, y) < hi, for all 
i= 1,...,n. Notice that the objective function is linear, while the restrictions 
can contain quadratic sections. 

The general case may be restricted to the bounded MIQCP (bMIQCP), 
with rational variables ranging over [0,1] and integer variables range over 
10,1). M C [0,1] is bMICQP-representable iff there is a bMICQP 
(ai, bi, Ci, hi) with k real and m 0 — 1 variables such that M = {a : dy € 
10, 1) such that a; x + bi: y +Qu(x,y) > hi or as: 12+b¡-y+Qu(x, y) < hi. 

This problem is known to be NP-Hard. Some examples of solvers are 
CPLEX® or Mosek?. 

Concerning Mixed Integer Non Linear Programming (MINLP), let z = 
(21,...,2) and y = (Yy1,---;Ym) be variables over Q and Z respectively, 
and, for all i € 0,1,...,n, let h; be an integer number, and f;(x,y) be 


Shttp://www.coin-or.org/projects/Cbc.xml 
Shttp://www.gurobi.com 
Thttp://sourceforge.net/projects/lpsolve 
Shttp://www.ilog.com/products/cplex/ 
http: //www.mosek.com/ 
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an k + m-ary (possibly non linear) function. The Mixed Integer Non Lin- 
ear Programming (MINLP) problem [160] is to find z € Q*,y € Z” such that 
fo(Z,¥) = Minsez yeyifolx, y)} subject to a set of n constraints of the form: 
filz, y) > hi, for all i = 1,...,n. 

As in the previous cases, in the bounded MINLP (bMINLP), rational vari- 

ables range over [0,1] and integer variables range over {0,1}. The problem is 
NP-Hard, and there some available solvers, such as Bonmin!0. 
OR-based decision algorithm. Now, unlike Section 8.2.2.1, we show here 
a simple method that allows to determine bed(K, ¢) and bsd(K, 9) via Mixed 
Integer Linear Programming (MILP) for propositional Lukasiewicz logic and, 
thus, for SFL as well (see, e.g., [359]). To this end, it can be shown that 


bed(K,p) = minz. such that K U {(~ġ,1 — 2)) satisfiable (8.52) 
bsd(K,¢) = maxz. such that KU {(¢,x)} satisfiable . (8.53) 


In fact, e.g., concerning bed(K, p), suppose the minimal value of x is i. We will 

know then that for any fuzzy interpretation Z satisfying the knowledge base 

such that Z(¢) < ñ, the starting set is not satisfiable (otherwise A wouldn't be 

minimal) and, thus, Z(¢) > ñ has to hold, which means that bed(K, $) = n. 
Note that in effect Equation (8.52) is the same as 


bed(K,¢) = minz. such that KU {(¢ < x)} satisfiable. (8.54) 
Without loss of generalization, for Lukasiewicz logic, we may consider a for- 


mula ¢ in Negation Normal Form (NNF) (i.e., negation may be in front of an 
atom only), which is obtained by applying recursively the rules 


30 = 1 

~] = 0 

mo > e 
(PA) > =V- 
avy) > APA? 
AP) > PAR. 


With nnf(¢) we denote the NNF obtained from ¢. 

For a formula ¢ consider a variable x¿ (with intended meaning: the de- 
gree of truth of ¢ is greater or equal to xg). Now we apply the following 
transformation o that generates a set of MILP in-equations: 


bed(K, $) = min z. such that x € [0,1], 
Tag 2 1- T, o(=4), (8.55) 
for all (Y, n} € K, zy > n,a(w), 


lO http: //www.coin-or.org/projects/Bonmin.xml 
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where the o(-) is defined as follows: 


Lp E [0,1] if o=p 
zo=0 if $=0 
xi=1 if p = 
me Le Fasa 
xy € [0,1] 

a(¢) = 
Lg, Y Tha = To, if o=¢o A Q2 


7(01),0(92), 14 E [0,1] 


Toi ® Uo. = To, 3 = 
a(¢1),0(¢2),2¢ € [0,1] if $= ġı V ġ2 


Marion) > Lg. = Té, i _ 
o(nnf(7¢1)), 0(¢2), ze € [0,1] if =. 


In the definition above, the conditions 11 ® £2 = Z, £1 @% = Z, £1 > T2 = Z 
with 0 < z;,z < 1, can be encoded as the sets of constraints: 


e 119/12 = Z > {£1 +£2—1 < 2z,£1+£2—1 > z—y,z < 1-y, y € {0,1}}, 
where y is a new variable. 


e xı Oi T2 = Z > {T1 + £2 < Z +y,Y < 2,11 + 22 > 2,y € {0,1}}, where 
y is a new variable. 


ery >ı T2 = Z > {(1 — 21) O £2 = Z}. 


As the set of constraints is linearly bounded by K and as MILP satisfiabil- 
ity is NP-complete, we get the well-known result that determining the best 
entailment /satisfiability degree is NP-complete for propositional Lukasiewicz 
logic and SFL. 

For completeness, we illustrate here also the equations for propositional 
SFL alone as well: 


© ii Qg T2 = Z> {z < 01,2 < £2, %1 E Z+y, £2 < z+(1—y),y € {0,1}}, 
where y is a new variable. 


e 21 Bg T2 = 2 {2 > 41,2 > £2, £1 +Y > 2,22 +(1—y) > 2,y € (0,1)), 
where y is a new variable. 


e zi Sha T2 = Z > (1 — 21) Og T2 = Z. 
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Concerning the best satisfiability problem, in a similar way we derive 


bsd(K, $) = max z. such that x € [0,1], 
Te > 2,0 (0), (8.56) 
for all (Y, n) € K, £y > n,o(w) , 


while we may determine whether a KB K is satisfiable by checking whether 
the set of constraints 


sat(K) = {xy > n,0(4) | (n) € K} (8.57) 


has a solution. 

It is illustrative to show that the method based on OR in which we com- 
pile a KB into a set of constraints, can be represented as well via an inference 
method on formulae, as illustrated by the E-ORFuzzySat algorithm (see Al- 
gorithm 3). 

Then, it can be shown that 


Proposition 50. Let K be a propositional fuzzy KB K in Eukasiewicz logic. 
Then 


1. The L-ORFuzzySat procedure terminates. 
2. K is satisfiable iff the E-ORFuzzySat procedure returns true. 


Note that a model Z of K can immediately be built from a solution by assigning 
to any atom p occurring in K 


T(p) = Ip, 


where gp is the value of variable x, is a solution to Cx. We also have in this 
model that 


I($) = 25 
for any formula ¢ occurring in K. 
We illustrated the method for the KB satisfiability problem. It is quite 


straightforward to adapt it to best entailment and the best satisfiability prob- 
lems as well via Equations (8.55) and (8.56). 


Example 25. Consider 


K = {(p, 0.8), (p > q,0.9)) . 


Let us show that 
bed(K, q) = 0.7 . 
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Algorithm 3 L-ORFuzzySat. 


Input: A propositional fuzzy KB K in Lukasiewicz logic. 
Output: Check if K satisfiable and if positive provide a model of K 


1. Transform a KB K into NNF, i.e., by transforming the formulae in K into 
NNF; 


2. Initialize the fuzzy theory Tk and the initial set of constraints Cx by 


Tk = {¢|(¢,n) EK} 
Ck = {ty 2n| (o,n) EK}. 


3. Apply the following inference rules until no more rules can be applied. Of 
course, each rule instance is applied at most once. 


var). For variable xg occurring in Cx add xg € [0, 1] to Cx. 
var). For variable x¿ occurring in Cx add ry = 1 — tg to Cr. 
. If 0 € Tx then Cx := Cx U {20 = 0}. 
. If 1 € Tk then Cx := Cx U {x1 = 1}. 
A). If HAVETK, then 
(a) add ¢ and y to Tx 
(b) Cx :=Cx U {2g O Ly = Tony). 
(Vv). If OV Y € Tk, then 
(a) add $ and y to Tx 
(b) Cx := Cr U {tg D £y = Lory}. 
(>). If 6 > y € Tx, then 
(a) add nnf(-7¢) and y to Tx 
(b) Cx = CR UI I= Ennf(r0)) > Ty = Too). 


( 
( 
(L) 
(T) 
( 


4. Return that K is satisfiable, i.e., true, iff the final set of constraints Cx has 
a solution. 


under Lukasiewicz logic. To do so, we need to determine 


bed(K, q) = minz. such that x € [0,1], 
Lig 212, 
l — Za = Lg) 
Lp > 0.8, 
Lp+q > 0.9, 
l — fp = Tap, 
Tap + Lq E pq +Y, 
YS pq» 
Tap F Tq > Tp>q, 


y € {0,1}. 


I can be verified that the minimal value of x is 0.7. 
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The case of L,. We conclude this section by pointing out that the L- 
ORFuzzySat procedure can easily be adapted to a case in which we consider 
the set Ln as truth space in place of [0, 1]. In fact, we have to simply enforce 
that now, any [0, 1]-valued variable x takes values in Ln instead. Now, 

1 n-2 


0e 
z € {0, —3> m1? 


1} 


can be encoded as 
z=(n-1).x 
z € {0,1,... n — 1} 


for a new integer variable z. Hence, for L,, we immediately have the analogue 
of Proposition 50: 


(8.58) 


Proposition 51. Let K be a propositional fuzzy KB K under Lukasiewicz logic. 
Then 


1. The £-ORFuzzySat procedure terminates. 


2. K is satisfiable w.r.t. Ln iff the L-ORFuzzySat procedure returns true, in 
which in the final set of MILP constraints Cx the [0, 1]-valued variables 
are enforced to take value in L,, as illustrated in Equation (8.58). 


It is yet unclear if the L-ORFuzzySat procedure can be adapted to the case in 
which other truth combination functions over £,, are considered. We do not 
address this further here. 


8.2.2.3 Analytical Fuzzy Tableau 


In this section we show that in fact there is a very simple decision procedure 
for the propositional KB satisfiability problem in SFL, without relying on OR. 
It is based on the following simple properties that hold in SFL: 


e if Z is model of (6 A w,n) then Z is a model of both (¢,n) and (w,n); 
e if Z is model of (4 V w,n) then Z is a model of either (¢,n) or (p,n). 
e T cannot be a model of both (p,n) and (p,m) ifn > 1—m. 


For the remainder of this section, w.l.o.g. we may consider SFL KBs in which 
any formula of the form ¢ —> w has been replaced with —g Vw. 
A clash is either 


e a fuzzy statement (0,7) with n > 0; or 
e a pair of fuzzy statements (p, n) and (=p, m) with n > 1—m. 


A KB is clash-free iff it does not contain a clash. 

The decision procedure for the KB satisfiability problem is illustrated by 
the SFD-TableauFuzzySat procedure described in Algorithm 4. 
It is easily verified that 
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Algorithm 4 SFD-TableauFuzzySat. 


Input: A propositional fuzzy KB K under SFL. 
Output: Check if K satisfiable. 


1. Transform a KB K into NNF, i.e., by transforming the formulae in K into 
NNF; 


2. Initialize the completion Sk = K; 


3. Apply the following inference rules to Sk until no more rules can be applied. 
We call a set of fuzzy statements Sx complete iff none of the rules below 
can be applied to Sx. Note that rule (V) is non-deterministic. 


(A). If (PAW, n) € Sk and {(¢,n), (Y, n)} Z Sk , then add both (6, n) and 
(p,n) to Sk. 

(V). If ($ V Y,n) € Sk and ((9, n), (Y, n) } N Sk =O, then add either (6, n) 
or (p,n) to Sk 

(>). If (o>Y,n) E Sk and (nnf(-0)vy,n) € Sk, then add 
(nnf(—9) V y, n) to Sk 


4. Return that K is satisfiable, i.e., true iff we find a complete and clash-free 
completion Sx of K. 


Proposition 52. Let K be a propositional fuzzy KB K. Then 
1. The SFD-TableauFuzzySat procedure terminates. 
2. K is satisfiable iff the SFD-TableauFuzzySat procedure returns true. 


Note that, unlike the L-ORFuzzySat procedure, the SFD-TableauFuzzySat 
procedure is non-deterministic as involves the non-deterministic rule (V) and, 
thus, in the worst case we have to generate exhaustively all possible (finite 
many) completions Sk of K. 


Example 26. Consider K of Example 25. Let us show that K is satisfiable 
under SFL. A clash-free completion of K is 


Sk = ((p, 0.8), (p > q, 0.9), (=p V q, 0.9), (q, 0.9) 
from which we immediately may build a model T with T(p) = 0.8, Z(q) = 0.9. 


It remains to show how to address the entailment problem, 2.e., to decide 
whether K [E (6, n) holds. To this end, it is easily verified that 


K H (9, n) iff KU {(¢ < n)} is not satisfiable, 


where an interpretation Z is a model of the expression (¢ < n) iff Z(9) <n. 
Given that we now have an additional construct, we will show that we 
have to ways to cope with it, that we will briefly illustrate. 
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The first method consists in extending the rules in the SFD- 
TableauFuzzySat procedure to deal with < expressions. To this end, rather 
than using expressions of the form (¢ < n), we allow expressions of the form 


($ n)* (8.59) 
where Z is a model of the expression (¢,n)* iff Z(¢) > n. Then, of course, 


KE (¢,n) iff CU((=0,1— n)T} is not satisfiable . (8.60) 


Now, we have to extend 

1. the notion of clash; and 

2. the inference rules to cope with expression of the form (¢,n)* 
That is, a clash is one of the following: 

1. a fuzzy statement (0,n) with n > 0; 

2. a fuzzy statement (0,n)*; 

3. a fuzzy statement (1,1)*; 

4. a pair of fuzzy statements (p, n) and (=p, m) with n > 1—m 

5. a pair of fuzzy statements (p,n) and (=p,m)* with n >1—m 

(p, n)" 


6. a pair of fuzzy statements (p,n)* and (=p, m) with n > 1- m 


7. a pair of fuzzy statements (p,n)* and (=p,m)* with n > 1- m 


The additional rules to be considered within the SFD-TableauFuzzySat pro- 
cedure are: 


(A)*. I£(9 A b,n)* € Sk and {(¢,n)", (V, n)t} Z Sk , then add both (6, n)" 

and (w,n)* to Sk. 

(v)*. If (dV, n)* € Sk and {(d,n)t, (Y, n) FAN Sk = 0, then add either 
(6,n)* or (b,n)* to Sk. 

(=>). If (> y,n)t € Sk and (nnfí=p) V4d,n)* € Sk, then add 
(nnf(=4) V p,n)* to Sk. 


We call the procedure with these rules and clash definition SFD- 
TableauFuzzySat+ and have: 


Proposition 53. Let K be a propositional fuzzy KB. Then K E (¢,n) iff the 
SFD-TableauFuzzySat+ procedure returns true with input KU{(Ad, 1 —n)T}. 
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We may also solve the best entailment degree problem by relying on the fol- 
lowing important observation (see, e.g., [394, 395, 396]). It has been shown 
that, in case of propositional SFL, from a set K of fuzzy statements of the 
form (4, n) it is possible to determine a finite set NK C [0,1], where |N*| is 
O(|K|), such that bed(K, 6) € NK, i.e., the best entailment degree of a formula 
p w.r.t. K has to be an element of NK. 


Proposition 54 ([394]). Let K be a propositional fuzzy KB in NNF under 
SFL and let ọ be a formula. Then bed(K, $) € NE and bsd(K, $) € NK, where 


NE = {0,0.5,1} U [n]|(6,n) €K}. 


Therefore, bed(K,¢) can be determined by computing the greatest value 
n € NE such that K F (¢,n). An easy way to search for this n is to or- 
der the elements of NK and then to perform a binary search among these 
values by successive entailment tests. Similarly, bsd(K, $) can be determined 
by computing the greatest value n € NK such that KU {(¢,n)} satisfiable. 
Note that in this latter case we may rely on the SFD-TableauFuzzySat pro- 
cedure. 

Another method that does not require consideration of new constructs of 
the form (¢,n)*, is to represent (¢,n)* with a fuzzy statement of the form 
(p,n +€), for a sufficiently small e (this idea has been, e.g., used in [390]). 
To choose such an e, we may take advantage of Proposition 54. In fact, let us 
define 

NE = NF u{l—n|neNn*}. (8.61) 


Now, we define e as half of the minimal absolute difference among the values 
in NEB. That is, 


e = min{d/2| n,m € N*,n4m,d=|n—ml}. (8.62) 
Now we have that 


Proposition 55. Let K be a propositional fuzzy KB. Then for n > O 
KE (p,n) if KU {(n¢,1—n+)} is not satisfiable . 
Moreover, K is satisfiable iff it has a model over the truth set NX. 


Proposition 55 allows us now to use the SFD-TableauFuzzySat procedure in 
place of SFD-TableauFuzzySat+ to decide both the entailment problem and 
the best entailment degree problem. 

Note also that a model Z for K can be built as follows. Determine a clash- 
free completion Sx of K. Then build a model Z by assigning Z(p) = n if 
(p,n) € Sk and for which there does not exist another (p,m) € Sk with 
m>n. 

We complete this section, by showing that in fact the SFD- 
TableauFuzzySat and SFD-TableauFuzzySat+ procedures can further be gen- 
eralized to the Lukasiewicz logic case over Ln. The main idea is to revert all 
rules to non-deterministic rules in the following sense: 
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e if Z is model of ($ A 4,n) then Z is a model of both (¢,n1) and (Y, na) 
for some n1, n2 € Ly, such that n Qı n2 = n; 


e if Z is model of (dV; Y, n) then Z is a model of both (¢,n1) and (4, na) 
for some n1, n2 € Ln such that n O na =n. 


Note the close relationship between the above properties and the (A) and (V) 
rule in the L-ORFuzzySat procedure, in which we used variables xg and ry 
in place of the values nı and na instead. 

In fact, the following proposition shows that, using the fuzzy operators of 
Lukasiewicz fuzzy logic to combine two truth degrees a and b, no new degrees 
can appear. 


Proposition 56. Let => € Ln. Then, under the fuzzy operators 
of Lukasiewicz fuzzy logic, Op a qq 8 —, ai 4 — > — 
Ln [351]. 


Let L-TableauFuzzySat procedure be as the SFD-TableauFuzzySat procedure, 
in which we replace the rules with 


(A). If (dAv,n) E Sk, nina E Ln such that m Sy ng = n and 
(0, 1), (w,n2)+ Z Sk , then add both (¢,n1) and (4%, nı) to Sk. 

(V). If ($6Vy,n) € Sk, nina E Ln such that m O, no = n and 
[(9,n1), (Y, n2)} Z Sk , then add both (¢,n1) and (4, nı) to Sk. 

(>). If (ó>v,n) € Sk and (nnfitó0)Vy,n) € Sk , then add 
(nnf(=9) V Yn) to Sk. 


The LE-TableauFuzzySat+ procedure is determined from the SFD- 
TableauFuzzySat+ procedure, by adapting additionally the (A)* and (v)* 
rules in a similar way as we did above and we can show: 


Proposition 57. Let K be a propositional fuzzy KB K. Then 
1. The £-TableauFuzzySat procedure terminates. 


2. K is satisfiable according to Lukasiewicz logic over L iff the E- 
TableauFuzzySat procedure returns true. 


3. K RF (¢,n) according to Lukasiewicz logic over Ly iff the Ł- 
TableauFuzzySat+ procedure returns true with input KU[(20,1—n)*). 


Eventually, it is not difficult to see that indeed, the SFL-TableauFuzzySat can 
be modified to deal also with the case in which the truth combination functions 
are specified via a matrix, as illustrated in Section 8.1.7. Specifically, in line 
with Equation (8.35), let 

f: (La) > Ln 


be a k-ary truth combination function, represented via a matrix My with 
k + 1 columns and nf rows, where the first k columns are the arguments of 
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the function and the last column contains the evaluation of the function on 
the arguments. At first, in the following, we use expressions of the form 


(db, n)= (8.63) 


where Z is a model of the expression (¢,n)= iff Z(¢) =n. 
Now, we extend 


1. the notion of clash; and 
2. the inference rules to cope with expressions of the form (¢,n)>. 


A clash is a pair (¢,n1)= and (6, n2)= with nı Æ na, while the additional rules 
to be considered are of the following form. For any k-ary truth combination 
function f 


(My).  (f(d1,...,0x),n)— € Sk and there are n; € Lp (1 < i < k) such 
that f(n1,... nk) = n with (d;,n;)7 ¢ Sk, then add all (di, ni} to 
Sk. 

Of course, as the truth space Ln is finite, any rule (My) can be applied only 

finitely many times to an expression (f(¢1,...,@%),n)~ € Sk. 


Now, the M-TableauFuzzySat procedure derived from the SFD- 
TableauFuzzySat procedure is described in Algorithm 5. 


Algorithm 5 M-TableauFuzzySat. 
Input: A propositional fuzzy KB K with matrix-based truth combination functions. 
Output: Check if K satisfiable. 


1. Build non-deterministically a KB K’ such that 
(a) C'I < IKI 
(b) for each (ġ,n) € K there is (ġ, n’) € K’ with n' >n. 
(c) there is no pair (¢,n) and (¢,n’) in K’ such that n Zn. 


2. Initialize the completion Sk = K’; 


3. Apply the following inference rules to Sk until no more rules can be applied. 
We call a set of fuzzy statements Sx complete iff none of the rules below 
can be applied to Sx. Note that the rules are non-deterministic. For any 
k-ary truth combination function f, consider 


(Mp). If (f(¢1,.--,¢%),2) E Sk and there are n; € Ln (1 <i < k) such 
that f(n1,...,nx) =n with (di, ni) ¢ Sx, then add all ($i, ni) to 
Sk. 


4. Return that K is satisfiable, i.e., true iff we find a complete and clash-free 
completion Sx of K. 


It is easily verified that 
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Proposition 58. Let K be a propositional fuzzy KB K with matriz-based truth 
combination functions. Then 


1. The M-TableauFuzzySat procedure terminates. 
2. K is satisfiable iff the M-TableauFuzzySat procedure returns true. 


3. K = (p,n) iff the M-TableauFuzzySat procedure returns false for all 
KU((p, ny) with n' <n and Y € Ln. Here we assume that Step 1 of 
the M-TableauFuzzySat procedure adds (y,n')= to K' as well. 


Note that procedure M-TableauFuzzySat can straightforwardly be extended 
to any finite and arbitrary structured truth space L, such as a complete lattice, 
residuated lattice, etc. 


Corollary 4. Proposition 58 holds also in case matrix-based truth combina- 
tion functions are defined over any finite and arbitrary structured truth space 
L in place of In. 


Of course, there is a high amount of non-determinism in M-TableauFuzzySat 
and it remains to be seen if such a method is viable in practice (possibly 
involving several optimizations limiting the choices of the values). 


8.2.2.4 Reduction to Classical Logic 


In this section, our aim is to map both propositional SFL and 
Lukasiewicz logic knowledge bases into satisfiability and entailment preserv- 
ing classical propositional knowledge bases. An immediate consequence is 
then that we can rely on already implemented crisp SAT reasoners to rea- 
son within these two propositional fuzzy logics fuzzy. This idea has been used 
e.g., in [61, 396]. 

We illustrate first the case of SFL and then generalize it to 
Lukasiewicz logic. 

Before we are going to formally present the mapping, we first illustrate the 
basic idea we rely on. Our mapping relies on ideas presented in [36, 37] for 
so-called regular multi-valued logics. 

Assume we have a fuzzy KB 


K = [fa,,09,03,04) with 
a, = (A,0.4), 

ag = (54,0.3), 

ag = (5B,0.8), 

a4 = (=B,09). 


Let us introduce some new formulae, namely A>0.4, A<o.7, B<o.2 and B<o.1. 
Informally, the expression A>o.4 represents A is true to degree c > 0.4, while 
A<o.7 represents A is true to degree c < 0.7. Similarly, for the other formulae. 
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Of course, we have to consider also the relationships among the introduced 
formulae. For instance, B<o.1 —> B<0.2. This formula dictates that if B’s 
truth value is < 0.1 then it is also < 0.2. We may represent, thus, the fuzzy 
statements a, with the formula A>o.4. Similarly, ag may be mapped into 
A<o.7, a3 may be mapped into B<o.2, while ag may be mapped into B<o.1. 
This illustrates our basic idea. 

Formally we proceed as follows. Let K be propositional SFL KB in 
NNF in which all formulae of the form (¢ — 4,n) have been replaced with 
(nnf (=e V 4), n). 

Consider the set Ln. Without loss of generality, it can be assumed that Ln 
is ordered: Ln = [y1,..., Yn} and yi < Yi+1,1 < i < n — 1. It is easy to see 
that y, = 0 and y, = 1. We define L$ = {c € Lẹ : c 4 0). 

Let A be the set of fuzzy atomic propositions occurring in K. For each 
a, B € Ln with a € (0, 1] and 6 € [0, 1), for each A € A, two new atomic for- 
mulae A>a, A>g are introduced. As, represents the (non-fuzzy) proposition 
“A’s truth is > n”. A>g has similar meaning. The atomic elements A>1, A>0 
are not considered because they are not necessary, due to the restrictions on 
the allowed degree of the propositions in the fuzzy KB. 

The intended semantics of these newly introduced atomic formulae is pre- 
served by some implication formulae. For each 1 < i < n—1,2 < j < n—1 and 
for each A € A, Crispr, is the smallest set of classical formulae containing 
these two type of formulae: 


Azi > Ax, ) 
8.64 
Ady; > Ady; 


As next, we need to map non-atomic fuzzy formulae into crisp variants. We 
do this by reflecting the semantics of the logical connective. For instance, 
(A A B, 0.25) will be mapped into A>0.25AA>0.35 and similarly, e.g., (~A, 0.75) 
will be mapped into 450.25. 

Formally, the reduction of fuzzy formulae mapping p. 


Crispx = {p(¢,n) | (¢,n) € K} U (8.65) 
Crispy l 


n) 


where p(¢, n) is defined in Table 8.8: 
It is easily verified that |Crispx| is linearly bounded by |K] and the fol- 
lowing satisfiability preserving reduction property can be shown. 


Proposition 59. Let K be a propositional SFL KB. Then K is satisfiable iff 
Crispx is satisfiable, where Crispx is obtained according to Table 8.8. 


Concerning the entailment problem, we may easily reduce it to the satisfia- 
bility problem as, for c > 0, (24,1 — c)™ can be represented as (ng, 1 — c7), 
where c” is the next smaller value than c in Ly. 
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TABLE 8.8: Mapping of propositional SFL statements to crisp statements. 


y | p(x,y) 
c 
L OIT 
L c| Life>0 
A C Ase 
aA C Asie 
PAV | e | plg, c) A plo, e) 
VY lc | p(9,c)v plo, e) 


Therefore, by relying on Equation (8.60), for c > 0, we have 
KE (9,0) iff KU {(Ad, 1 — c~)} is not satisfiable , (8.66) 


where c” is the next smaller value than c in Ln and, thus, we may apply 
Proposition 59 to solve the entailment decision problem. Eventually, the best 
entailment degree problem can be reduced to calls to the entailment problem 
for various c € Ln, similarly as described in Section 8.2.2.3. 

We next show how the mapping above can be extended to the case of 
Lukasiewicz logic as well. To this end, we recall the property 


e if Z is model of ($ A 4,n) then Z is a model of both (¢,n1) and (4, na) 
for some n1, na € Ln such that ni Qi ng =n. 


The main difference now is we cannot simply map, e.g., (A A B,0.25) into 
A>0.25 A A>0.35, but we need to take into account all possible combinations of 
nı and na such that nı 9, n2 = 0.25 and, thus, (A A B,0.25) is rather mapped 
into 

V Ase, A Bey 

C1,C2 
for cu, € L} such that cı + c2 — 1 = 0.25. 

The mapping is shown in Table 8.9. 


Proposition 60. Let K be a propositional Lukasiewicz logic KB. Then K 
is satisfiable iff Crispk is satisfiable, where Crispk is obtained according to 
Table 8.9. 


As for the SFL case, both the entailment and the best entailment degree 
problems can be reduced to the satisfiability problem (see Equation (8.66)). 

Eventually, as for Section 8.2.2.3, we conclude by showing that the idea 
illustrated for Lukasiewicz logic can be adapted to the case of matrix-based 
truth-combination functions (see Section 8.1.7). At first, for y € L,, we intro- 
duce atomic formulae of the form 


Az, 
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TABLE 8.9: Mapping of propositional Lukasiewicz logic statements to crisp 
statements. 


z y | p(z, y) 
c 
L 0|T 
eli c|LlLife>0 
A C Ase 
A C AS ik 
PAV le | Vac P(B c1) A p(y, c2) 
for c1,C2 € L} such that a + c2 — 1 = c. 
Vy c P(c) V pl, V Vec plo, c1) A pl, ca) 
for c1,C2 € L such that cy + c2 = c. 


Informally, the expression A—, represents A is true to degree y. Then, Crispr, 
is the smallest set of classical formulae containing: 


Amara Am \ Aan > L (8.67) 


stating that A cannot have two different truth degrees. Now, the reduction 
of fuzzy formulae with truth combination functions defined via matrixes is 


defined as 
Crispx = {p(d,n') | (n) EK, n > n}u 


ein (8.68) 


Te 2 


where p(¢,n) is defined in Table 8.10, and f is a k-ary truth combination 
function 


ERE AR 
Then we have that 


TABLE 8.10: Mapping of matrix-based fuzzy propositional statements to 
crisp statements. 


x y | pl) 
< T 
T cllife<l 
ali OIT 
L cplifc>0 
A c | Az, 
Hor, --.,Bx) | © | Vier... er) (Olr) A «+. A pl be, ek))) 
for c; € Ln (1 < i < k) such that f(c1,...,Cx) =e. 


Fuzzy Sets and Mathematical Fuzzy Logic Basics 153 


Proposition 61. Let K be a propositional fuzzy KB with matriz-based truth 
combination functions. Then K is satisfiable iff Crispk is satisfiable, where 
Crispx is obtained according to Table 8.10. 


As for the SFL case, both the entailment and the best entailment degree 
problems can be reduced to the satisfiability problem (compare with Propo- 
sition 58). 


Proposition 62. Let K be a propositional fuzzy KB with matriz-based truth 
combination functions. Then K = (w,n) iff for alln! < n with n! € Ly, 
Crispk U (p(y,n')) is not satisfiable. 


Note that procedure M-TableauFuzzySat can straightforwardly be extended 
to any finite and arbitrary structured truth space L, such as a complete lattice, 
residuated lattice, etc. 


Corollary 5. Proposition 58 holds also in case matrix-based truth combina- 
tion functions are defined over any finite and arbitrary structured truth space 
L in place of In. 


8.2.3 Concrete Domains and Aggregation Operators 


In this section, we will show how we may extend fuzzy logic endowed with 
so-called fuzzy concrete domains [359, 398]. Specifically, we introduce the logic 
P(D), a fuzzy propositional Lukasiewicz logic under extended with concrete 
domains in order to handle numerical, as well as non-numerical features, al- 
lowing to express, for instance, e.g., I am searching for a passenger car costing 
about 25000€ yet if the car has a GPS system and more than two-year war- 
ranty I can spend up to 28000€. 

The notion is essentially an extension of the notion of concrete domains 
we have introduced for description logics (see Section 4.2.2.2). 

For instance, we may be able to express 


“I would like a passenger car with an alarm system if it costs more 


than 25000€” 
using the expression 
PassengerCar A ((price >25000) > AlarmSystem) 


In the formula above the expression (price >25000) acts as a so-called con- 
crete domain, i.e., the atom price has a fixed interpretation over a domain, in 
this case an integer Z(price) and (price >25000) will be true under an inter- 
pretation Z if Z(price) > 25000. Such expressions are called hard constraints 
in [359]. Similarly, we may express so-called soft constraints (or fuzzy con- 
straints) [359], we instead may be satisfied to a certain degree. For instance, 
we may be able to express 
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“if a GPS system is mounted on the car she can spend up to 
22000€for a sedan. The less I spend the more I’m satisfied. I’m 
definitely satisfied if I spend less than 18000€” 


using the expression 
GPS_system > (price 1s(18000, 22000)) . 


In the above expression, 1s(18000, 22000) is the well-known left-shoulder fuzzy 
set membership function we have seen in Section 8.1.1. Now, under un inter- 
pretation Z, the expression (price 1s(18000, 22000)) will have a degree of truth 
T((price 1s(18000, 22000))) determined by the evaluation of the left-shoulder 
function on the price, i.e., 


T((price 1s(18000, 22000))) = 1s(18000, 22000) (Z(price)) . 


We next provide the exact formulation. Similarly to Section 4.2.2.2, a fuzzy 
concrete domain, also called a fuzzy datatype theory D= (AP, P) consists of 
a datatype domain AP and a mapping -P that assigns to each data value 
an element of AP, to each datatype into a function from AP to [0,1]. The 
elementary datatypes we allow here are of two types: hard datatype and soft 
datatype. A hard datatype has one of the following forms: for any rational 
number n 


eae <n, n 9 
with semantics 
JP) = life >n, else 0 
(<,)P(z) = 1lifa<n, else 0 
(=,)P(z) = life=n, else0 . 


A soft datatype has one of the following form: 
Is(a,b), rs(a,b), tri(a,b,c), trz(a,b,c,d), 


with semantics (see Section 8.1.1) 


(Is(a, b))? (2) Is(a, b)(x) 
(rs(a,b))P (x) = rs(a,b)(x) 
(tri(a,b,c))P (£) = tri(a,b,c)(x) 
(trz(a, b,c, d))P (x) trz(a, b,c, d)(x) . 


Now, let F be an alphabet of feature names, distinct from the alphabet of 
propositional letters. A concrete atom is an expression of the form 


(f d) (8.69) 


11Recall that Z(price) is an integer. 
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where f is a feature name and d is a datatype. 
A formula is now inductively defined as follows: 


1. every propositional letter is a formula; 
2. every concrete atom is a formula; 


3. if 4 and ¢ are formulae, then so are 0, WA Q, YV O, Y > 0. As usual, 
we use Y in place of (Y > 6) A ( > Y); 


4. if d is a formula and w € [0, 1], then w- ¢ is a formula; 


5. if %1,..., Vx are formulae, then w1: Y1 +...+x :Yx is a formula, where 
€ [0,1] and $0, w; < 1; 


6. if ọ is a formula, then Im(a,b)(6) is a formula, where Im(a, b) is a fuzzy 
modifier (see Section 8.1.8); 


7. if @ is an aggregation function (see Section 8.1.6) and Y1,...,Yx are 


formulae, then Q(p1,...,4x) is a formula. 
A P(D) fuzzy statement is defined as follows: 


e if w is a formula and n € (0, 1] then (Y, n) is a fuzzy statement of P(D). 
If n is omitted, then (4, 1) is assumed. 


Eventually, a P(D) fuzzy knowledge base is a set of P(D) fuzzy statements. 
A fuzzy interpretation T w.r.t. a concrete domain D, 


1. maps each propositional literal into [0, 1]; 
2. maps each feature name into AP. 


T is inductively extended to formulae as follows: 


T((f d) = APS) 


L(y) 1—-Z(y) 
T(y Ag) Z(y) e L(¢) 
TYV e) Z(y) e Tle) 
T(Y > f) L(w) >, Z(6) 
T(w- $) w- (0) 
(Im(a.b)(6)) Im(a, b)(Z ne 
L(wi- yr +... + wr: Vr) Leet Wi) 


T(@Q(y1,..., r)) 


a (pr), Zr) - 


Eventually, the notions of a model of a fuzzy statement, of a knowledge base, 
and that of entailment, best entailment degree and best satisfiability degree 
are defined as in Section 8.2.1. For instance, Z is a model (satisfies) of a fuzzy 
statement (Y, n) iff Z(y) > n 
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Example 27 ([48, 359]). Assume, that a car seller sells a sedan car. A buyer 
is looking for a second-hand passenger car. Both buyers as well as sellers 
have preferences (restrictions). Our aim is to find the best agreement. The 
preferences are as follows. Concerning the buyer: 


1. 
2. 


He does not want to pay more than 26000 euro (buyer reservation value). 


If there is an alarm system in the car then he is completely satisfied with 
paying no more than 22300 euro, but he can go up to 22750 euro to a 
lesser degree of satisfaction. 


He wants driver insurance and either theft insurance or fire insurance. 


He wants air conditioning and the external color should be either black 
or gray. 
Preferably the price is no more than 22000 euro, but he can go up to 


24000 euro to a lesser degree of satisfaction. 


The kilometers warranty is preferably at least 175000, but he may go 
down to 150000 to a lesser degree of satisfaction. 


The weights of the preferences 2-6 are, (0.1, 0.2, 0.1, 0.2, 0.4). The 
higher the value the more important is the preference. 


Concerning the seller: 


. He wants to sell no less than 22000 euro (seller reservation value) 


If there is a navigator system in the car then he is completely satisfied 
with paying no less than 22750 euro, but he can go down to 22500 euro 
to a lesser degree of satisfaction. 


Preferably the buyer buys the Insurance Plus package. 


The kilometers warranty is preferably at most 100000, but he may go up 
to 125000 to a lesser degree of satisfaction. 


The monthly warranty is preferably at most 60, but he may go up to 72 
to a lesser degree of satisfaction. 


If the color is black then the car has air conditioning. 


The weights of the preferences 2-6 are, (0.3, 0.1, 0.3, 0.1, 0.2). The 
higher the value the more important is the preference. 


We have also some background theory about the domain: 


1. 


A sedan is a passenger car. 


2. A satellite alarm system is an alarm system. 
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3. The navigator pack is a satellite alarm system with a GPS system. 


4. The Insurance Plus package is a driver insurance together with a theft 
insurance. 


5. The car colors are black or gray. 
Now, the background knowledge can be encoded as: 


Sedan > PassengerCar 

Satellite Alarm > AlarmSystem 

Navigator Pack + (Satellite Alarm AGPS_system) 
InsurancePlus + (DriverInsurance ATheftInsurance) 
ExColor Black V ExColorGray 

ExColorBlack A ExColorGray — 0 


The buyer's preferences can be encoded as follows: 
1. BO (PassengerCar A (HasPrice <36000)) 
. By O (AlarmSystem > (HasPrice 1s(22300, 22750)) 
Ba + (DriverInsurance A (TheftInsurance V FireInsurance)) 
B; + (Airconditioning A (ExColor Black V ExColorGray)) 


By + (HasPrice 1s(22000, 24000)) 


. Bs o (HasK MW arranty rs(15000, 175000)) 


AA A w BW 


. Buy + (BA ((0.1- B1) + (0.2 - B2) + (0.1- B3) + (0.2 - B4) + (0.4 - Bs))) 


Please note that the concept Buy collects all the buyer’s preferences together 
in such a way that the higher the maximal degree of satisfiability of Buy 
(î.e., bsd(K, Buy)), the more the buyer is satisfied. 
The seller’s preferences can be encoded as follows: 
1. S = (Sedan A (HasPrice > 22000)) 
. Sı = (Navigator Pack > (HasPrice rs(22500, 22750)) 
„ S2 = InsurancePlus 
S3 = (HasK MW arranty 1s(100000, 125000)) 
. S4 = (Has MW arranty 1s(60, 72)) 
. S5 = (ExColor Black > AirConditioning) 
„ Sell = (SA ((0.3 - S1) + (0.1 - S2) + (0.3 - S3) + (0.1 - S4) + (0.2 - S5))) 


RDAs Co WD 


Similarly to the buyer case, the concept Sell collects all the seller’s preferences 
together in such a way that the higher is the maximal degree of satisfiability 
of Sell (i.e., bsd(K, Sell)), the more the seller is satisfied. 

Now, it is clear that the best agreement among the buyer and the seller is 
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determined by the mazimal degree of satisfiability of the conjunction Buy A 
Sell, i.e., we have to determine 


bsd(K, Buy A Sell) . 


In particular, in a model guaranteeing the maximal degree of satisfaction of 
Buy A Sell, we have that 


bsd(K, Buy A Sell) = 0.7 
HasPrice =  22000.0 
HasKMWarranty = 175000.0 
HasMWarranty = 0.0. 


Reasoning. We extend the OR-based KB satisfiability decision algorithm (see 
Section 8.2.2.2, Algorithm Ł-ORFuzzySat) to deal with concrete domains and 
aggregates. The extension to the best entailment degree and best satisfiability 
degree problems is as for Section 8.2.2.2. 

The crucial point is to show that we may map the additional constructs 
we have introduced here into sets of MILP constraints, that is, the newly 
introduced operators should be MILP representable, which is indeed the case. 
The rules can be found in Appendix F. 


8.2.4 On Fuzzy IF-THEN Rules 


One of the most important features of fuzzy logic is its ability to perform 
approximate reasoning [473], which involves inference rules with premises, 
consequences or both of them containing fuzzy propositions. A very popular 
formalism, due to their practical success, is a fuzzy rule-based system. 

A fuzzy IF-THEN system [242] consists of a rule base (a set of IF-THEN 
rules) and a reasoning algorithm performing an inference mechanism such as 
Rete [161]. In general, the input of the system is the current value for the 
input variable and the output is a fuzzy set, which can be defuzzified into a 
single value. In a fuzzy IF-THEN rule, its antecedents, consequences, or both 
are fuzzy. Fuzzy IF-THEN rules are fired to a degree, which is a function of 
the degree of match between their antecedent and the input. The deduction 
rule is Generalised Modus Ponens. 

Roughly speaking, given a rule “IF A THEN B”, where A and B are fuzzy 
propositions, it is possible from a premise “A'” which matches A to some 
degree, to deduce “B’”, which is similar to B. 

One of the most popular IF-THEN systems is the Mamdani model [303]. 
In a Mamdani model, fuzzy rules have the form 


IF X, IS A AND ... AND Xn IS A, THEN Y ISB, (8.70) 


where A; and B are linguistic values defined by fuzzy sets on universes of 
discourse X; and Y respectively, i = 1,...n. 
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For every clause in the antecedent of the rule, the matching degree between 
the current value of the variable and the linguistic label in the rule is com- 
puted (typically, using the minimum or another t-norm). If there exist several 
clauses, they are aggregated into a firing degree, using a fuzzy logic operator 
(typically, the maximum). Then, this firing degree is used for modifying the 
consequent of the rule using some function (typically the minimum). Some- 
times this function is referred to in the literature as an implication function, 
but this is a misleading term which should be avoided (e.g., minimum is not 
an implication function) [187]. 

Rules are fired using some inference algorithm. The computed conse- 
quences related to the same variable are aggregated (typically, using the max- 
imum). Then, the output variables can be defuzzified. Some examples of de- 
fuzzification methods are the LOM (Largest Of Maxima), SOM (Smallest Of 
Maxima), and MOM (Middle Of Maxima) [242]. More precisely, let B de- 
note the fuzzy set to be defuzzified and let x be an arbitrary element of the 
universe. Then for all x: 


e zLOM is the LOM iff yglrzom) > uglz) and, if ug(xrom) = plz) 
then tom > «x. 


zsom is the SOM iff uB(zsom) > wa(x) and, if uelusom) = unl(x) 
then zsom < t. 


e «vom is the MOM iff zuom = (trom + tsom)/2. 


Now, we show here how to use P(D) (see Section 8.2.3) to represent Mamdani 
fuzzy IF-THEN rules and to reason with them (differently from [3], which only 
allows to represent fuzzy controllers). For more insights, we refer the reader 
to [186]. The interesting thing is that it is not only possible to represent a 
fuzzy control problem, but also background knowledge related to it. 

In order to represent a fuzzy control problem we proceed as follows. Firstly, 
for each variable of the system, we define a concrete feature name f repre- 
senting it and specify its range (a subset of the reals [fmin, fmax]). Then, we 
define the different linguistic labels, which will be used to describe the value 
of these variables, using triangular fuzzy sets. 

Next, we define a formula resuming the rule base. It is well known [186] that 
a set {r1,...,?p} of Mamdani rules r; of the form (8.70) can be transformed 
in a formula Mamd of the form 


Mamd \ (Ai, Ag--- Ag Ain ^g Bi) (8.71) 


An alternative [186] consists in involving an r-implication to represent a rule. 
That is, a set ([r1,...,rp) of Mamdani rules r; of the form (8.70) can be 
transformed in a formula Rules of the form 

Rules A (Aa A... A Ain) > Bi) (8.72) 


in 
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Then, we represent the input of the system as fuzzy propositions. 


Example 28 ([48]). For the sake of concrete illustration, consider an adaption 
of the simple example available at http: //en. wikipedia. org/wiki/Fuzzy_ 
system. It considers the design of a fuzzy controller for an anti-lock brak- 
ing system. The encoding in P(D) is shown in Table 8.11. The system has 
two input variables, temperature and pressure, which are used to compute an 
output value which will adjust the throttle. Specifically, 


e temperature has 5 labels associated: cold, cool, nominal, warm, and hot. 
e pressure has 5 labels: weak, low, ok, strong, and high. 


e throttle has 7 labels: N3 (large negative), N2 (medium negative), N1 
(small negative), Z (zero), P1 (small positive), P2 (medium positive), 
and P3 (large positive). 


The rules of the system are the following: 


1. IF temperature IS cool AND pressure IS weak, THEN throttle is P3. 
2. IF temperature IS cool AND pressure IS low, THEN throttle is P2. 

3. IF temperature IS cool AND pressure IS ok, THEN throttle is Z. 

4. IF temperature IS cool AND pressure IS strong, THEN throttle is N2. 


The linguistic labels of the variables are defined using fuzzy concrete con- 
cepts. For example, TempCool, the label representing that the temperature 
is cool, can be defined as tri(480,500,520), ThrottleP2 as tri(10, 15,20) and 
ThrottleZ as tri(15, 20, 25). 

Now assume that the temperature is 489.6, which is represented as 
(temperature =ago.¢). Then, Hcoo (1481.92) = tri(480, 500, 520)(489.6) = 
0.48. We also assume that the pressure is 42.15, i.e., we have (pressure = 
42.15), and that prow(42.15) = 0.57, ox (42.15) = 0.25 and weak (42.15) = 
Lstrong(42-15) = Hhign(42.15) = 0. That is, the temperature is in the cool 
state, the pressure is in the low and ok states, in such a way that only rules 2 
and 3 are fired. 

Rule 2 is fired with a degree min{0.48, 0.57} = 0.48, so throttle is P2 with 
degree 0.48. Rule 3 is fired with a degree min{0.48, 0.25} = 0.25, so throttle 
is Z with degree 0.25. These two values for throttle are aggregated using the 
MALIMUM. 

Then, the defuzzification step is performed. It can be verified that LOM = 
17.6, MOM = 15, and SOM = 12.4. 

For concrete feature name throttle representing an output variable 
of the system, LOM is implemented in the following steps (instruction 
(defuzzify-lomMamd throttle) in Table 8.11). 


1. Compute the maximum degree y of satisfiability of Mamd, that is, we 
determine y = bsd(K, Mama). 
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TABLE 8.11: Fragment of a P(D) fuzzy KB example for fuzzy control. 


%feature names 
temperature 
pressure 
throttle 


% Labels for the variables 
TempCool := tri(480, 500, 520),... 
PressLow := tri(30, 40, 45) 
PressOK := tri(40, 48.6, 57.2),... 
ThrottleP2 := tri(10, 15, 20), 
ThrottleZ := tri(15, 20, 25),... 


% Encoding of Mamdami Rules 

Rule, + ((temperature TempCool) Ay 
Rulez + ((temperature TempCool) Ag 
Rulez + ((temperature TempCool) Ag 
Rules + ((temperature TempCool) Ag 


pressure PressWeak) Ay (throttle ThrottleP3)) 
pressure PressLow) Ag (throttle ThrottleP2)) 
pressure PressOK) Ag (throttle ThrottleZ)) 
pressure PressStrong) Ag (throttle ThrottleN2)) 


DOCO 


% Encoding of Mamdami Rule base 
Mamd = (Rule: Vg Rulez Vg Rulez Vg Rulez Vg Rules) 


% Input to the controller 
(temperature =489.6) 
(pressure =42.15) 


% Output of the controller 
(defuzzify-lom? Mamd throttle) 


2. Ensure that Mamd is true equal or greater than y, i.e., we add the fuzzy 
statement (Mamd, y) to the KB. 


3. Now, maximize the value of the (internal) variable xinrottie representing 
the value of throttle, i.e., determine 


Uthrottle := MAX Lthrottle S-t. KU{(Mamd,y)} satisfiable 


4. Output the result Vihrottle- 


The SOM can be computed in a similar way, but minimizing the value of the 
(internal) variable Linrottte representing the value of t. Finally, MOF can be 
computed as (LOM + SOM)/2. 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


Chapter 9 


Fuzzy RDF & RDFS 


9.1 Introduction 


RDF [361] has become a quite popular Semantic Web representation 
formalism. The basic ingredients are triples of the form (s,p,0), such as 
(tom, likes, tomato), stating that subject s has property p with value o. 

However, under the classical semantics, RDF cannot represent vague in- 
formation and, to this purpose, some Fuzzy RDF variants have been pro- 
posed [313, 314, 315, 429, 447, 448, 450]: essentially they allow us to state 
that a triple is true to some degree, e.g., (tom, likes, tomato) is true to degree 
at least 0.9.1 

Our main goal of this part is to provide, under very general semantics, a 
minimal deductive system for fuzzy RDF, along the lines described by [325]. 
That is, we essentially fuzzify the RDFS sub-language pdf, which is the core 
part of RDFS (see Chapter 3). To this end, the main conceptual shift parallels 
the one from crisp statements to fuzzy statements (see Chapter 8). Namely, 
rather than interpreting an RDFS triple (s, p, 0) being either true or false, we 
associate to it a degree of truth taken from the truth space [0,1]. Therefore, 
and not surprisingly, as we did for mathematical fuzzy logic (see Section 8.2), 
in which fuzzy statements were of the form (¢,n) with n € [0,1], in fuzzy 
RDFS we will have fuzzy triples of the form ((s,p,0),n) with the intended 
meaning “the degree of truth of the triple (s, p, 0) is equal or greater than n.” 


9.2 Fuzzy RDF é RDFS 


Syntax. To start with, we recall from Section 3.2 that a pdf triple is a triple 
T Of the form 
(s,p,0) € UBL x U x UBL, 


1We also refer the reader to [476, 265, 264, 427] to an even more general setting 
in which we may annotate a triple with an element of a so-called annotation domain, 
i.e., an idempotent and commutative semi-ring and the relative SPARQL query language 
AnQL http://anql.deri.org/. 


163 


164 Foundations of Fuzzy Logic and Semantic Web Languages 
where pdf is defined as the following subset of the RDFS vocabulary 
pdf = (sp, sc, type, dom, range} , 


supporting sub-property (sp), subclass (sc), typing (type), property domain 
restriction (dom), and property range restriction (range). 
Now, a fuzzy RDFS triple is an expression 


(7,n) , 


where 7 is a triple and n € [0,1]. The intended semantics is that the 
degree of truth of the fuzzy triple is not less than n. For instance, 
((audiTT, type, SportsCar), 0.8) is a fuzzy triple, intending that AudiTT is 
almost a sports car. In a fuzzy triple (7, n), the truth value n may be omitted 
and, in that case, the value n = 1 is assumed. 

A fuzzy RDFS graph G (or simply a fuzzy graph, or fuzzy RDF Knowledge 
Base) is a set of fuzzy RDF triples. 

The notions of universe of a graph G, the vocabulary of G, ground graph, 
and variable assignment are as for the crisp case (see Section 3.2). 

Without loss of generality we may also assume that there are not two fuzzy 
triples (7,n) and (7,m) in a fuzzy graph G. If this is the case, we may just 
remove the fuzzy triple with the lower score. 


Semantics. The semantics is a natural extension of the crisp one, in which 
the interpretation of properties and classes become fuzzy sets instead of crisp 
sets. 

In the remainder, let us fix a t-norm & and its dual O. A fuzzy interpre- 
tation T over a vocabulary V is a tuple 


L= (Ar, Ap, Ac, Az, PI], CL], 7) > 


where Ar, Ap, Ac, Ax are interpretation domains of Z and P[:], C[-], % are 
interpretation functions of Z. 
They have to satisfy: 


1. Ar is a nonempty finite set of resources, called the domain or universe 
of T; 


2. Ap is a finite set of property names (not necessarily disjoint from An); 


3. Ac C Ap is a distinguished subset of Ar identifying if a resource de- 
notes a class of resources; 


4. AL C Ap, the set of literal values, Az contains all plain literals in LAV; 


5. P[:] maps each property name p € Ap into a function Plp] : ARXAR > 
[0, 1], i.e., assigns a degree to each pair of resources, denoting the degree 
of being the pair an instance of the property p; 
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6. C[:] maps each class c € Ac into a function C[c] : Ar > [0,1], i.e., as- 
signs a degree to every resource, denoting the degree of the resource 
being an instance of class c; 


7. % maps each t € ULN V into a value t? € Ag U Ap and such that + 
is the identity for plain literals and assigns an element in Ar to each 
element in L. 


Note that the only difference so far relies on points 5 and 6, in which the 
extension functions become now fuzzy membership functions. 

The notion entailment is defined using the idea of satisfaction of a graph 
under certain interpretation. Intuitively a ground fuzzy triple ((s, p, 0), n) in 
a fuzzy RDF graph G will be satisfied under the interpretation Z if p is in- 
terpreted as a property name, s and o are interpreted as resources, and the 
interpretation of the pair (s, o) belongs to the extension of the property as- 
signed to p to a degree not less than n. 

Formally, an interpretation Z is a model of an annotated ground graph 
G, denoted T E G, if and only if Z is an interpretation over the vocabulary 
pdf U universe(G) that satisfies the following conditions: 


Simple: 
1. ((s,p,0),n) € G implies p? € Ap and Plp?](s%, 0%) > n; 
Subproperty: 
1. Plsp7](p, a) e Plsp”](a,r) < Plsp”](p,r); 
2. Pip iz, y) ® Plsp”](p,a) < Pla] (e, y); 
Subclass: 
1. P[sc*](c, d) ® P[sc*](d, e) < Pjs] (c, e); 
2. C[c*](x) e P[sc*](c, d) < Pla] (£); 
Typing I: 
1. Cld(e) = Pltype”] (e, 0) 
2. P[dom*](p, c) ® Plp](=, y) < Clel(z); 
3. Plrange”](p, c) ® Pp] (x,y) < Clel(y); 


Typing IT: 


1. For each e € pdf, e? € Ap; 

2. Plsp*](p, q) is defined only for p,q € Ap; 

3. Cl[sc?](c, d) is defined only for c,d € Ac; 

4. P[dom?] (p, c) is defined only for p € Ap and ce Ag; 
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5. Plrange*](p, c) is defined only for p € Ap and c € Ac; 
6. Pltype*](s, c) is defined only for c € Ac. 


It is easily verified that the classical setting is as the case in which the truth 
space is La = 40,1) in place of [0,1] (the t-norm can be arbitrary). 

It also easily verified that, e.g., the condition 1 of the subclass condition 
encodes the fuzzy FOL statement 


(Vz.[(e(z) > d(z)) A (d(x) > e(x))| > (e(z) > d(z)), 1) , 


that, by the way, can be derived via axioms (41) and (A5a) of the BL logic 
(see Section 8.2.2.1). The other conditions are derived similarly. 

Finally, entailment among annotated ground fuzzy graphs G and H is as 
usual. Now, G + H, where G and H may contain blank nodes, if and only if 
for any grounding G” of G there is a grounding H’ of H such that G’ E H’. 


Example 29. Suppose we want to state the following: Skype collaborators are 
also Ebay collaborators to some degree since Ebay possesses 30% of Skype’s 
shares, and also that Toivo is a part-time Skype collaborator: 


((SkypeCollab, sc, EbayCollab), 0.3) 
((toivo, type, SkypeCollab), 0.5) 


Then, e.g., under the product t-norm Sp, we can infer the following triple: 
((toivo, type, EbayCollab), 0.15) . 


Informally, as Toivo is a Skype collaborator and a Skype collaborator is an 
Ebay collaborator, then Toivo is an Ebay collaborator, too. Concerning the 
degree to which this statement holds, by Equation (8.25), we have that Toivo 
îs an Ebay collaborator to degree not smaller than 0.5 8p 0.3 = 0.15. 


Remark 22. Note that we always have that G = (7,0). Clearly, triples of the 
form (7,0) are uninteresting and, thus, in the following we do not consider 
them as part of the language. 


As for the crisp case, it can be shown that: 
Proposition 63 ((429]). Any fuzzy RDFS graph has a finite model. 


Therefore, we do not have to care about consistency. 


9.3 Fuzzy Conjunctive Queries 


Concerning query answering, we extend naturally the conjunctive queries 
defined in Section 3.3 and more extensively in Section 6.2 to the fuzzy setting. 
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As in Section 3.3, we assume that a fuzzy graph G is ground, that is blank 
nodes have been skolemized, i.e., replaced with terms in UL. 

Informally, queries are as for the classical case where triples are replaced 
with fuzzy triples in which fuzzy variables (taken from an appropriate alphabet 
and denoted A) may occur. We allow built-in triples of the form (s, p, 0), where 
p is a built-in predicate taken from a reserved vocabulary and having a fired 
interpretation, such as (A, <,l) stating that the value of A has to be < than 
the value l € L. We generalize the built-ins to any n-ary predicate p, where 
p’s arguments may be fuzzy variables, pdf variables, domain values of [0, 1], 
values from UL, and p has a fixed interpretation. We will assume that the 
evaluation of the predicate can be decided in finite time. As for the crisp case, 
for convenience, we write “functional predicates” as assignments of the form 
x:=f(z) and assume that the function f(z) is safe. We also assume that a 
non-functional built-in predicate p(z) should be safe as well. 

To start with, a fuzzy query is of the form 


(q(x), A) — 3y3A'.p(x, A, y, A’) (9.1) 


in which p(x, A, y, A’) is a conjunction (as for the crisp case, we use “,” as 
conjunction symbol) of fuzzy triples and built-in predicates, x and A are the 
distinguished variables, y and A’ are the vectors of non-distinguished variables 
(existential quantified variables), and x, A, y and A’ are pairwise disjoint. 
Variable A and variables in A’ can only appear in place of degrees of truth or 
built-in predicates. The query head contains at least one variable. 

For instance, the query 


(q(x), s) — ((a, type, SportsCar), s1), (a, hasPrice, y), s:=81 - cheap(y) , 


where, e.g., cheap(p) = 1s(10000, 1500)(), has intended meaning to retrieve 
all cheap sports cars, where /s(10000, 1500) is a left-shoulder fuzzy membership 
function (see Section 8.1.1). Any answer x is scored according to the product 
of being cheap and a sports car. 

Given a fuzzy graph G, a query (q(x), A) E 3y3A".p(x, A, y, A’), a vector 
t of terms in universe(G) and a truth degree A in [0,1], we say that (q(t), A) 
is entailed by G, denoted G E (q(t), A), if and only if in any model Z of G, 
there is a vector t’ of terms in universe(G) and a vector A” of truth degrees 
in [0,1] such that Z is a model of y(t, A, t’, ’). If G & (q(t), A) then (t, A) is 
called an answer to q. The answer set of q w.r.t. G is 


ans(G,q) = {(t, A) | GE (a(t), A), A £0 and 
for any X Z A such that G — (q(t), A’), M <A holds} . 


That is, for any tuple t, the truth degree A is as large as possible. 


Disjunctive fuzzy queries with aggregation operators. As we did in 
Section 6.2, concerning crisp queries, we extend the notion of fuzzy conjunctive 
query to include aggregation operators as well. 
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As we have seen, aggregates may be like the usual SQL aggregate functions 
such as SUM, AVG, MAX, MIN. But now, we have also logic specific aggregates 
such as © and 9. 

Formally, let @ be an aggregate function with 


@ e (SUM, AVG, MAX, MIN, COUNT, 9, @} 


then a query with aggregates is of the form 


(q(x), A) <= AIy3IA'.p(x, y, A’), 
GroupedBy(w), (9.2) 
A:=@[f(z)] , 


where w are variables in x or y and each variable in x occurs in w and any 
variable in z occurs in y or A’. 

From a semantics point of view, we say that Z is a model of (satisfies) 
q(t, A), denoted Z } (q(t), A) if and only if 


A = Qf,- Ax] where g = {(t, th, A4),...,(b, A, th, Ap) D 
is a group of k tuples with identical projection 
on the variables in w, y(t, t., A.) is true in Z 


and A, = f(t) where t is the projection of (t/., A.) 
on the variables z . 


Now, the notion of G } (q(t), A) is as usual: any model of G is a model of 
(q(t), A). 

We conclude by defining a disjunctive query q as usual as a finite set of 
conjunctive queries in which all the rules have the same head. Intuitively, the 
answers to a disjunctive query are the union of the answers of the conjunctive 
queries. That is, for a disjunctive query q = {q1, . - - , qm}, G entails q (denoted 
G H q) iff GE q; for some q; € q. 


Top-k Retrieval. As now each answer to a query has a degree of truth 
(i.e., score), the basic inference problem that is of interest is the top-k retrieval 
problem, formulated as follows. 

Given a fuzzy graph G, and a query q, retrieve k answers (t,A) with 
maximal degree and rank them in decreasing order relative to the degree A, 
denoted 


ans,(G,q) = Top; ans(G, q) . 


We point the interested reader to Appendix G for a variant of the SPRQL 
query language for RDFS (see Section 6.3) allowing to query fuzzy RDFS 
graphs. 
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9.4 Reasoning 


In what follows, we provide a sound and complete deductive system (for 
the graph entailment problem) for fuzzy pdf (see [429]). 

The most important feature of fuzzy RDFS is that we are able to provide a 
deductive system in the style of the one for classical RDFS (see Section 3.4). 
Moreover, only the support for the domain dependent ® and © operations 
has to be provided and, thus, are amenable to an easy implementation on top 
of existing systems. The rules in Table 9.1 are arranged in groups that cap- 
ture the semantic conditions of models, A, B,C, X, and Y are meta-variables 
representing elements in UBL and D, E represent elements in UL. The rule 
set contains two rules, (la) and (10), that are the same as for the crisp case, 
while rules (2a) to (5b) are the annotated rules homologous to the crisp ones. 
Finally, rule (6) is specific to the annotated case. 

Please note that rule (6) is destructive i.e., this rule removes the premises 
as the conclusion is inferred. We also assume that a rule is not applied if the 
consequence is of the form (7,0) (see Remark 22). 

The reader may notice also that, except for rule (1a), (1b) and (6), all rules 
apply the so-called graded deduction rule we have seen in Equation (8.44), 
related to mathematical fuzzy logic. 

It can be shown that: 


Proposition 64 (Soundness and completeness [429]). For a fuzzy graph, the 
proof system HF is sound and complete for |=, that is, (1) if GE (T, A) then 
G H (T, A) and (2) if G E (T, A) then there is X > A with GF (T, A. 


We point out that rules 2— 5 can be represented concisely using the following 
inference rule: 


TA A {T1; ae - Tn} FRDES T) 
(T, &; Xi) 


Essentially, this rule says that if a classical RDFS triple 7 can be inferred 
by applying a classical RDFS inference rule to triples ™,...7, (denoted 
[T1,...,Tnj Frors 7), then the degree of truth of 7 will be &, A;, where 
A, is the degree of truth of triple 7;. It follows immediately that, using rule 
(AG), in addition to rules (1) and (6) from the deductive system above, it is 
easy to extend these rules to cover the complete RDFS rule set. 

Finally, like for the classical case, the closure is defined as cl(G) = {(r, A) | 
G +* (7,A)), where F* is as F without rule (la). Note again that the size 
of the closure of G is polynomial in |G| and can be computed in polynomial 
time, provided that the computational complexity of operations ® and © are 
polynomially bounded (from a computational complexity point of view, it is 
as for the classical case, plus the cost of the operations 8 and © in L). 


(40)! 


(9.3) 
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TABLE 9.1: Inference rules for fuzzy pdf. 


. Simple: 


(a) for a map p: @ > G 


Or for G CG 


. Subproperty: 


(A, SP, B), A), ((B,sp, C), 2) 
((A, sp, C), A ® A2) 


(a) 


((D, sp, E), A), (X, D,Y), A2) 
(E, Y), AMO A2) 


(b) 


. Subclass: 


((A, sc, B), A1), ((B, sc, C), A2) 
((A, sc, C), Ar 8 A2) 


(a) 


((A, SC, B), A), (X, type, A), A2) 
(X, type, B), Az S A2) 


(b) 


. Typing: 


((D, dom, B),A1), (X, D, Y), Az) 


DA type, BY, © Az) 


((D, range, B), A), (X, D,Y), A2) 
(Y, type, B), A & A2) 


(b) 


. Implicit Typing: 


((A, dom, B), A1), (D, sp, A), A2), (X, D, Y), Az) 


(4) ((X, type, B), A1 8 Az 8 Az) 


((A, range, B), A1), ((D, sp, A), A2), (X, D,Y ), As) 


b 
6) (ye B), 8208) 


. Generalization: 


((X, A, Y); A1), ((X, A, Y), A) 
(X, A, Y), Az ® A2) 
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Eventually, similar propositions as Propositions 5 and 7 hold. Specifically, 


Proposition 65. Gt H if and only if H C cl(G) or H is obtained from 
cl(G) by applying rule (La). 


Therefore, like the crisp case, a simple method to determine G + 7, where 
both G and 7 is ground, consists in computing cl(G), where the size of the 
closure of G is O(|G|?), and check whether 7 is included in cl(G). 

Now, under the above closure computation, we have, as for the crisp 
case [325]: 
Proposition 66 (Size of Closure). 

1. The size of the closure of G is O(|G|?). 


2. The size of the closure of G is in the worst case no smaller than Q(|G|?). 


We conclude by showing how to compute the answer set of a fuzzy query. 
Essentially, it follows the same procedure as for the crisp case (see Section 3.4). 
In fact, we have that 


Proposition 67. Given a graph G, (t,A) is an answer to q if and only if 
3yJA'.p(t, A, y, A’) is true in the closure of G and A is <-maximal.? 


By relying on Proposition 67, we may devise a similar query answering method 
as for the crisp case: a method to determine ans,(G,q) is as follows. 


1. Compute the closure cl(G) of G and store it into a database that sup- 
ports top-k retrieval (e.g., RankSQL [258]°). 


2. It can easily be verified that any fuzzy query can be mapped into a top-k 
SQL query [258] over the underlying database schema. 


3. Hence, ansk(G, q) is determined by issuing these top-k SQL queries to 
the database. 


Example 30. Consider 
G = {((p, dom, c), 0.5), ((d, sc, c), 0.4), ((b, type, d), 0.3), ((a, p, b), 0.2) } 
under product logic. Then the closure of G is 
cl(G) = GU {((b, type, c), 0.12), ((a, type, c),0.01)} . 
Therefore, for the query 
(a(x), s) e (x, type, c), s) 


we have that 
ansı(G, q) = {((b, type, c), 0.12)} . 


23y3A".p(t, A, y, A’) is true in the closure of G if and only if for some t’, X for all triples 
in y(t, A, t’, A”) there is a triple in cl(G) that subsumes it and the built-in predicates are 
true, where an fuzzy triple (7, A1) subsumes (7, Az) if and only if A2 < Aa. 

3But, e.g., Postgres http://www.postgresql.org/, MonetDB http: //monetdb. cui .n1/ 
may work as well. 
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Chapter 10 


Fuzzy OWL 


10.1 Introduction 


In Chapter 4, we have introduced the Web Ontology Language OWL [338] 
and its successor OWL 2 [101, 340], which together with RDFS (see Chapter 3) 
have become the most popular Semantic Web representation formalisms. We 
have also shown their relationship with the Description Logics family of logics. 

The basic ingredients are the descriptions of classes, properties, and their 
instances, such as 


e a:C, such as a:Person M VhasChild.Femal, meaning that individual a is an 
instance of concept/class C (here C is seen as a unary predicate); 


e (a,b):R, such as (tom, mary):hasChild, meaning that the pair of individ- 
uals (a,b) is an instance of the property/role R (here R is seen as a 
binary predicate); 


e CLD), such as Person E VhasChild.Person, meaning that the class C is 
a subclass of class D; 


e REP, such as isContainedin E isPartOf, meaning that the property R 


is a subproperty of the property P. 


However, under the classical semantics, OWL cannot represent vague infor- 
mation, such as “tom likes apples to some grade,” and, to this purpose, several 
fuzzy variants have been proposed: they can be classified according to 


e the description logic resp. ontology language that they generalize [42, 
49, 51, 52, 55, 138, 285, 286, 288, 287, 289, 291, 292, 367, 368, 369, 383, 
391, 398, 399, 407, 415, 454, 471]; 


e the allowed fuzzy constructs [48, 220, 223, 222, 218, 219, 221, 217, 194, 
195, 227, 299, 419]; 


e the underlying fuzzy logic [47, 45, 53, 188, 189, 397, 411, 405]; 
e applications [4, 114, 54, 262, 320, 386, 392, 393, 401, 406, 434, 435, 418, 
430, 432, 431, 474]; 
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e their reasoning algorithms and computational complexity results [40, 50, 
41, 43, 27, 28, 64, 70, 72, 71, 68, 74, 69, 73, 75, 91, 92, 469, 342, 384, 
390, 389, 395, 396, 404, 410, 425, 426, 412, 475]. 


We also refer the reader to [290, 416, 422] for a survey. 

As DLs [19] (see Chapter 4) are the logical counterpart of the family of 
OWL languages, to illustrate the basic concepts of fuzzy OWL, it suffices to 
show the fuzzy DL case. Briefly, one starts from a classical DL, and attaches 
to the basic statements a degree n € [0, 1], similarly as we did for fuzzy RDFS 
in Chapter 9. 

Our main goal of this part is to provide a very general semantics and 
deductive systems similarly as we did in Chapter 4. 


10.2 Fuzzy Description Logics Basics 


In general, fuzzy DLs allow expressions of the form (a:C, n), stating that a 
is an instance of concept C with degree at least n, i.e., the FOL formula C(a) is 
true to degree at least n. Similarly, (Cy E C2,n) and (Ry E Ra,n) state vague 
subsumption relationships. Informally, (C1 E C2,n) dictates that the FOL 
formula Vx.Cy (1) — Ca(x) is always true to degree at least n. Essentially, 
Fuzzy DLs are then obtained by interpreting the statements as fuzzy FOL 
formulae and attaching a weight n to DL statements, thus, defining so fuzzy 
DL statements. 


10.2.1 Syntax and Semantics 


Let us consider the DL SROTQ without concrete domains (see Sec- 
tion 4.2.2.3 and Table 4.1). Consider n € [0,1], and the crisp SROZO ax- 
ioms (A1) - (A17) in Table 4.1. Then the following are fuzzy axioms of fuzzy 
SROTQ: 


e fuzzy GCIs of the form (C E D,n); 


e fuzzy concept assertions of the form (a:C,n); 


e fuzzy role assertions of the form ((a,b):R,n) and ((a, b):R,n); 


e fuzzy RIAs of the form (R,...R, E R,n); 
e axioms (A11), (A12), (A14)-(A17), i.e., trans(R), dis(S1, S2), ref(R), 
irr(S), sym(R) and asy(S). 


We will write fuzzy axioms also as (a, n), where a is a GCI a RIA or an 
assertion. We will use interchangeably, if clear from context, a in place of 
(a, 1). We will say in this case that a is a classical axiom. 
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We make one change only to the concept constructs, by allowing graded 
singleton concept expressions of the form {n/a}, where n is a degree of truth, 
in place of the crisp variant {a}. 

It is worth noting that one may find in fuzzy DLs also fuzzy statements 
of the form (a> n}, (a<n), (a>n), (a<n), and (a =n), stating that the 
degree of truth of axiom a is bounded by en, where e € {>, <, >, <,=}. 
We stick here to the form (a,n), i.e., («> n), only (i) for uniformity among 
the various fuzzy Semantic Web Languages presented in the book; and (ii) as 
(a, n) is the most used one. Besides, by reminding that graded statements are 
intended to be produced semi- or automatically, it is hardly conceivable that 
they may have, e.g., the form (a <n), (a > n) or (a < n). However, sometimes 
we will still use, if needed, expressions of the form (a e n). 

A fuzzy knowledge base is now, similarly as for crisp SROTQ, a triple K = 
(T, A, R), where now fuzzy axioms occur in place of classical axioms. T, A 
and R are called now fuzzy TBox, fuzzy ABox, and fuzzy RBox, respectively. 


Example 31. Consider the following background knowledge about cars en- 
coded as the fuzzy TBoz T*: 


Car E JHasPrice.Price 
Sedan E Car 
Van E Car 
CheapPrice E Price 
ModeratePrice E Price 
ExpensivePrice E Price 
(CheapPrice E  ModeratePrice, 0.7) 
(ModeratePrice E ExpensivePrice, 0.4) 
CheapCar = CarMJ3HasPrice.CheapPrice 
ModerateCar = Carl dHasPrice.ModeratePrice 
ExpensiveCar = Carl dHasPrice.ExpensivePrice 
fun(HasPrice) 


Essentially, the vague concepts here are CheapPrice, ModeratePrice, and 
ExpensivePrice and the graded GCIs declare to which extent there is a rela- 
tion ship among them. 

The facts about two specific cars a and b are encoded with the following 
fuzzy ABoz A: 


(a:Sedan N JHasPrice.CheapPrice, 0.7) 
(b:VanN AHasPrice.ModeratePrice,0.8) . 


So, a is a sedan having a cheap price, while b is a van with a moderate price. 

The semantics is a natural extension of the crisp one for DLs presented in 
Section 4, in which the interpretation of concepts and roles become fuzzy sets 
instead of crisp sets. Specifically, we consider the first order reading of the 


¡For the sake of clarity, here fun(R) is a macro for T E (< 1 R). 
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fuzzy DL constructs as from Section 4.2.2.3, specifically Table 4.1, and give 
them a fuzzy FOL semantics as illustrated in Section 8.2. 

For our purposes, let us consider truth combination functions 9, $, => and 
© defined on a truth space L as from Chapter 8. Here, L may be [0, 1], [0, Jo 
or Ln (so, e.g., ® doesn't necessarily have to be a t-norm, see Section 8.1.7). 

Then, a fuzzy interpretation T = (A7, 2) consists of a nonempty set A? 
(the domain) and of a fuzzy interpretation function Y that assigns 


e to each atomic concept A a function 47: A? > L; 
e to each abstract role R a function R?: A? x A? > L; 


e to each individual a an element a? € AZ such that a? 4 b? if a Æ b 
(UNA). 


C7 denotes the membership function of the fuzzy concept C with respect to 
the fuzzy interpretation Z. For x € AZ O7(z) gives us the degree of being z an 
element of the fuzzy concept C under Z. Similarly, R? denotes the membership 
function of the fuzzy role R with respect to Z. For x,y € AZ, R? (x,y) gives 
us the degree of being (x,y) an element of the fuzzy role R. 

Interpretations are extended to SROZQ constructs as illustrated in 
Table 10.1 (in Table 10.1, e.g., the condition (C3) has to be read as: 
(COD) (a) =C7(2) 8 D (2). 

We say that a fuzzy interpretation Z satisfies (is a model of) a fuzzy 
KB K = (T, A,R) iff it satisfies each element in A, T and R. A fuzzy KB 
K =(T,A,R) entails an axiom E, denoted K |= E, iff every model of K 
satisfies E. We say that two concepts C and D are equivalent, denoted C =x D 
iff in any model Z of K and for all x € AF, C7(x) = D7(z). 

As for the fuzzy FOL case (see Section 8.2.1), for concept assertion, role 
assertion GCI or role inclusion axiom ¢, we say that (¢,n) is a tight logical 
consequence of K iff n is the infimum of $7 subject to all models Z of K. 
Notice that the latter is equivalent to n= sup {r |K E (¢,r)}. n is called the 
best entailment degree of y w.r.t. K (denoted bed(K, @)), i.e., 


bed(K, e) = sup{r| KE (¢,r)} . (10.1) 


On the other hand, the best satisfiability degree of 6 w.r.t. K (denoted 
bsd(K, $)) is 


bsd(K, $) = sup {47 |Z EK}. (10.2) 
ob 


For a concept C, we also say that the best satisfiability degree of C w.r.t. K 
(denoted bsd(K, C)) is 


bsd(K,C) = sup sup C7 (x). 
TIEK xe At 
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TABLE 10.1: Syntax and semantics of the fuzzy DL SROZO. 


Concepts Syntax (C) Semantics: truth value of CZ (x) 
(Cl) A AZ (2) 
(C2) T TZ (0) =1 
(03) L 1Z(x)=0 
(C4) cnp CI (a) 8 DÍ (zx) 
(C5) CuD Ct (x) 8 DÍ (x) 
(C6) ac 807 (x) 
(C7) VR.C int caz RT (e, y) = CT (y)} 
(C8) 3R.C sup, eaZ {RT y) 9 C7 (y)} 
(C11) {n/a} n if z = a7, 0 otherwise 
(012) (> m S.C) sup ym eaZ OTE (57 (0, vi) 8 CT (vs) OOj cken tus A ve 
(C13) (< m S.C) AZ 
KOT (87 (e, wi) 0 CF (40) > (Oj<r<ny11uj = yr Dl 
(C16) 3S.Self st (o, x) 
Roles Syntax (R) Semantics of R(z, y) 
(RI) R RT (x,y) 
(R2) Re RT (y, x) 
(R3) U 1 
Other Syntax (X) Semantics: truth value of XZ 
(X1) a:C CZ (az) 
(X2) (a, b):R RT (az, pt) 
(X3) (a, b):R ORT (at, bt) 
(x6) CED inf „caz C7 (2) > DT (2) 
(x7) Ri. Ra OR eer a EÁ? ORT (21,29)... RẸ (æn; 2741) > R? (21; en 41) 
Axiom Syntax (E) Semantics (Z satisfies E if...) 
(Al) (a:C, n) cía) > n 
(42) ((a, b):R, n) RE (at, bt) > n 
(A3) ((a, b):R, n) ORT (at, bt) > n 
(A6) (CE D,n) inf CAT CT (a) > DT (xz) >n 
(A7) Br Rey ERA) e ap ea QIRȚ (21,292), -.-RE(2n, 2941) 
> RT (z1, en) > n 
(A9) trans( R) Vz, y € AT, RŽ (x, y) > sup caT R? (2,2) 0 RI (z, y) 
(A10) disi(S1, S2) Yz, y € AT, ST (æ, y) =0 or Sí (2, y) =0 
(A12) ref(R) va € AZ, RT (z,z)=1 
(A13) irr( S) va € AZ, ST (x, x) = 0, 
(A14) sym(R) Va, y € AT, RT (z, y) = RT (y, 20) 
(A15) asy(S) Va, y € AZ, if sZT(x, y) > 0 then SZ(y,2)=0 . 


Example 32. Consider Example 31 under Godel semantics. Then it can be 
shown that 


K | (a:ModerateCar, 0.7) 
K (d:ExpensiveCar, 0.4) . 


Informally, in the former case the reasoning is as follows. As a is a sedan (at 
least to degree 0.7), it is a car (at least to degree 0.7) and, thus, a is a car with 
a cheap price (at least to degree 0.7). Therefore, by the definition of a cheap 
car, a is, thus, a cheap car (at least to degree 0.7). In the latter case, as b is 
a van (at least to degree 0.8), it is a car (at least to degree 0.8) and, thus, b is 
a car with a moderate price (at least to degree 0.8). Therefore, as a moderate 
price is to some degree an expensive price, b has, thus, an expensive price (at 
least to degree min(0.8, 0.4) = 0.4). Eventually, by the definition of expensive 
car, b is, thus, an expensive car (at least to degree 0.4). 
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Remark 23. Like for the fuzzy FOL case, for which Y and 3 are not comple- 
mentary in general (see Remark 19), also for fuzzy DLs we have that VR.C 
and =3R.C are not, unlike the classical case, equivalent. However, they are 
equivalent under Lukasiewicz logic and SFL. 


Remark 24 (On Number Restrictions). In [49] it has been pointed out that, 
unlike the classical case (see Section 4.2.2.1), in the fuzzy case we do not have 
in general that (< n R.C) =-(> n+1 R.C) holds. 

For instance, in Eukasiewicz logic assume the following interpretation: 


((tom, apple): likes)? = ((tom, banana): likes)? 


((tom, orange): likes)? 
= ((tom,peach): likes)? = 0.5 


(apple:Fruit)’ = (banana:Fruit)* 
= (orange: Fruit)? 


= (peach:Fruit)” =1, 


where apple”, banana? , orange? , peach? are different. 

Then, (< 1 likes.FruitY (tom) = 1. In this example, while one may expect 
tom not liking more than one fruit, he likes many more fruits x; as long as 
they satisfy ((tom, xi): likes)? + (zi:Fruit) < 1. 

According to [49] the semantics of cardinality restrictions should satisfy 
the following properties instead: 


e If (<n R.C) (a) =1 then |{b | (R(a,d) 9 CM) > 0}| <n. 
e IR.C = (>1 RC). 


e (<n R.C)=-(>n+1 R.C). 


Therefore, [49] proposes the following semantics for number restrictions (using 
the equivalence Vx.p = 7Ax.¢, see Remark 19 and Remark 21): 


En RC) (x) = supy.. uneaz ming {R7 (x, y:i) 8 Cy) @(@j<n<n {yj A yeh] 


| 
= 
B 


(< n R.C) (2) Vista om reat ninti (R7 (2, yi) 8 CT (yi)} > (Oj<r<n+11Yj = 9) 


Note that, equivalences among fuzzy concepts are similar as for the fuzzy 


propositional case (see Table 8.6) as illustrated in Table 10.2. 


10.2.2 Some Additional Constructs 


It is also useful to introduce the following constructs: 


e C > D with semantics (C > DY (x) = CF (x) > DT (x); 
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TABLE 10.2: Some additional fuzzy concept equivalences. 


Property Lukasiewicz Godel Product | SFL 

Cnac=L e ° e 

CU-C=T ° 

cnc=C ° . 
CUC=C e e 
==C=C ° ° 
C>D=>= D ° , 
C>D==D>-C . e 
a(C > D)=C =D o e 
a(CND)=-7~CUuUAD ° ° . ° 
=(CUD)==CN=D ° ° o e 
CN(DUE)=(CND)U(CNE) e e 
CU(DNE)=(CUD)N(CUE e e 
IRC = -VR.AC ° . 


eCoDto(C>D)n(D>C); 


min{C, Dj for C N (C > D), and miníC;,..., Cn} for 
min{...min{C1, Co},...}; 


e max{C,D} for min((C > D) > D,(D > C) > C) and 
max{C},...,C,} for max{...max{C,C},...}; 


e for x € {g,l,p,s}, CN, D, CU, D, C >, D, indicating that the 
concept operators Mg, Uy and +, are interpreted according the logic 
x € {g,l,p,s}, where g,l,p, and s stand for Gödel logic, Lukasiewicz 
logic, product logic and SFL, respectively; 


e Cı = Ca for the two axioms Ci E Co and Cp E Ci; 


e (C1 =C2,n) for the axiom (T E (C1 + C2), n); 


e dom(R, C), called domain restriction axiom, for IR.T E C; 


e ran(R,C), called range restriction axiom, for T E VR.O; 
e (=n RC) for (> n R.C) N (< n R.C); 
e (=n R) for (=n R.T). 


Remark 25. It is worth noting that, w.l.o.g., an axiom (C E D,n) may be 
rewritten as (T E C > D,n), an axiom Cı = Co may be rewritten as T E 
Ci — Ca. 
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Remark 26 (Fuzzy DLs under SFL). /395/, which presents fuzzy ALC under 
SFL, proposes a slightly different semantics for fuzzy GCls. In fact, in [395] 
a fuzzy GCI is of the form C E D with semantics: T is a model of C E D 
iff for any x € AZ we have that C?(x) < DY(x). This is the same of any 
fuzzy axiom of the form (T EC >, D,1), where >, is an r-implication. To 
be compliant with [395] and most fuzzy DLs work developed later on under 
SFL, we will use expressions 


e CLD with semantics: T is a model CED iff for any x € AZ we have 
that C7 (x) < D*(x); 


e A=C with semantics: T is a model AC iff for any x € A? we have 
that At (x) = CF (x); 


e RCS with semantics: T is a model RCS iff for any x,y € A? we have 
that R (x,y) < S*(a,y). 


10.2.3 Acyclic Fuzzy Ontologies 


As for the crisp case (see Section 4.4.1.2), acyclic fuzzy ontologies play an 
important role in fuzzy DLs both as they occur often in practices as well as 
from a computational complexity point of view. 

Specifically, let us also introduce a restricted form of TBoxes, i.e., acyclic 
TBoxes. That is, let 7 be a Tbox in which the GCIs have one of the following 
form 


3 


Bh BB 
IM: IN 
YQ a 


3 


A = C, 
where A is a concept name, C is a concept, and A Ep C is a shorthand for 
(T E A> C,n) and A=, C is a shorthand for (T E AO C,n). We call the 
former two GCIs primitive and call the latter two definitional. 
We say that A is the head of these axioms and C is the body. Furthermore, 
we also assume that no concept name A is in the head of more than one axiom. 
Now, we say that 


e concept name A directly uses concept name B w.r.t. T, denoted A >y 
B, if A is the head of some axiom 7 € T such that B occurs in the body 
of 7; 


e concept name A uses concept name B w.r.t. T, denoted A ~y B, if 
there exist concept names A4,..., An, such that A = A, A, = B and, 
for every 1 <i < n, it holds that A; >r Aj41. 


Eventually, we say that a TBox T is cyclic (acyclic) if there is (no) A such 
that A ~r A. We say also that a fuzzy TBox is unfoldable if it is an acyclic 
TBox which only contains inclusion axioms of the form AEC and ASC. 
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10.2.4 On Witnessed Models 


By relying on Table 10.1, in a similar way as for fuzzy FOL (see Sec- 
tion 8.2.1.1), the use of infima (universal quantification Y) and suprema (ex- 
istential quantification 3) may lead to counterintuitive behaviors (see also, 
e.g., [186, 188, 189, 190]). For instance, consider the concept assertion 


(a:3R.A, 1) . 


Consider the interpretation Z with domain N, a? = 1, and for all n,m € N 


AZ (m) = pai 


n 


However, 


= 1 
(AR.A) (a?) = sup R7 (a7, n) @ AF (n) = sup AF (n) = sup1—— =1. 
nen nen nen n 


That is, unlike the crisp case, notwithstanding there is no individual n of the 
domain of Z satisfying 


R? (a,n) 9 A*(n) =1, 


still, Z satisfies the assertion (a:3R.A,1), which is the fuzzy DL analogue of 
Equation (8.47). Similar arguments apply to any of the expressions in Ta- 
ble 10.1 involving infima and suprema?. 

Additionally, in a similar way as for Equation (8.48), under Gödel logic we 
may build a fuzzy assertion, e.g., 


(a:VR.AM=3R.54, 1) 


that has no classical model, but it has a fuzzy one, which has to be infinite 
and non-witnessed. 

While such interpretations may exist in theory, we believe that they un- 
likely may model any practical knowledge representation and reasoning do- 
main. Therefore, we will restrict out attention to witnessed models only in the 
sense of Section 8.2.1.1. 

Specifically, a fuzzy interpretation Z is witnessed iff the supremum of every 
expression in Table 10.1 coincides with the minimum and the infimum of every 
expression coincides with the maximum. 


2That is, expressions of the form (C7), (C8), (C12), (C13), (X6), (X7), (46), (A7), (A9). 
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Note that it is obvious that all finite fuzzy interpretations (this means that 
A? is a finite set) are indeed witnessed but the opposite is not true. Note also 
that still Proposition 39 applies. 

Eventually, to what concern us here, we recall from [39] the following 
property. 


Proposition 68 ([39] Theorem 4.4.). In Lukasiewicz logic, thus, in SFL, an 
acyclic knowledge base K is satisfiable iff K has a finite model. 


Proposition 68 is not true if we drop the acyclicity condition, i.e., any GCI 
may occur in the TBox (see [39], Theorem 3.3). In fact, the following examples 
show that if we allow arbitrary knowledge bases, then the fuzzy DL ALC 
under Lukasiewicz and Product fuzzy logics do not verify the Finite Model 
Property (FPM) even if we restrict to witnessed models; in other words, finite 
satisfiability and witnessed satisfiability are different for arbitrary knowledge 
bases. 


Remark 27 ([39]). K is the fuzzy KB with the following axioms 


1) (a:A,0.5) 


2) 7 JR.T 


3) (VR.A) = (3R.A) 


(1) 
(2) 
(3) 
(4) A= (VR.A)N(VR.A) 


Now, let T be a witnessed model of K under Łukasiewicz fuzzy logic. Then, 
it can be shown [39] that for every natural number n there are individuals 
b1, b2, bs, Sor „ba such that 0.5 = AF (bı) < AZ (by) < AT (b3) LERNER A“ (bn) K 
1. Therefore, there is no finite model for K under Lukasiewicz fuzzy logic. 
Moreover, K is, under Lukasiewicz fuzzy logic, satisfiable by a witnessed model 
but not by a finite model. In fact, one witnessed model of K is the model T 
defined by 


e A7 = {1,2,3,... }U {00}, 
e RI is the crisp relation {(i,i +1): i =1,2,3,...} U {(c0, 00)), 
e A*(co) =1 and AF (i) = (2 — 1)/2* for every i = 1,2,3,... 


T=1. 


Similar results hold for K under product logic: K is, under product fuzzy logic, 
satisfiable by a witnessed model but not by a finite model. In fact, one witnessed 
model of this fuzzy KB is the model T defined by 


e At = {1,2,3,...}, 


e R is the crisp relation {(i,i+1):i=1,2,3,...}, 
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e A*(i) = ae (3) for every i = 1,2,3,..., 


Remark 28 ([39]). X’ is the fuzzy KB with the following axioms 
(1) (a:A, 0.5) 
2 


| JR.T 


) 
) 
3) (VR.A) = (3R.A) 
) 
) 


4) A=(VR.A) N (VR.A) 
(T E (54),0.1) 


( 
( 
( 
(5 


Note that K' is obtained by adding axiom (5) to K in Example 27. In [39] it 
has been shown that K' is unsatisfiable both under Lukasiewicz fuzzy logic as 
well as under product logic. 


We conclude by recalling the following interesting property. 


Remark 29. Let us point out that w.r.t. witnessed models, similarly to the 
classical case (see Section 4.4.1.2), if A is a concept name and 8 is continuous, 
and = is the related r-implication, then an axiom (A E D,n) may be rewritten 
as (A=CNM4',n), where A’ is a new concept name. 

Similarly, is it easily verified that any inclusion aziom ALC may be rewrit- 
ten as AC M A’, where A’ is a new concept name. 


Remark 30. Under the conditions of Remark 29, we may assume that GCIs 
in an acyclic TBox have one of the forms 


10.3 Salient Language Extensions 


As we did for fuzzy FOL (see Section 8.2.3), next we describe some salient 
extension to fuzzy DLs, which make fuzzy DLs particularly attractive from 
an application point of view. 


Fuzzy Concrete Domains. To start with, we discuss how one may provide 
concrete domains within fuzzy DLs. To do so, we rely on [398]. In general, 
similarly to Section 8.2.3 (see also Section 4.2.2.2), a fuzzy concrete domain, 
also called a fuzzy datatype theory D= (AP, -P) consists of a datatype do- 
main AP and a mapping -P that assigns to each data value an element of AP, 
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and assigns to every n-ary datatype predicate d an n-ary fuzzy relation over 
Ap. More specifically, fuzzy DLs do support unary datatypes only. Therefore, 
-P maps indeed each datatype predicate into a function from AP to [0,1]. 

Typical examples of datatype predicates d are the well known fuzzy mem- 
bership functions 


d := ls(a,b) | rs(a,b) | tri(a, b,c) | trz(a, b,c, d) 
and the crisp membership functions 


d := Zo ls 


where, e.g., ls(a,b) is the left-shoulder fuzzy membership function, while, 
e.g., >, corresponds to the crisp set of data values that are greater than 
equal to the value v. 

Concerning roles, a role R is either an object property or a datatype prop- 
erty. An interpretation maps an object property into a function A? x A? > 
[0,1], while maps a datatype property into a function A? x AP > {0,1}. A 
datatype property does not have an inverse, but may be functional. 

We also use an alphabet for concrete individuals, denoted v, and extend 
an interpretation to concrete individuals by mapping them into AP. As for 
individuals, we adopt the UNA, i.e., 012 % v27 if v1 F və. 

We can now extend concept expressions according to the following syntax: 


C.D => WTd|3Td, 


where d is a datatype and T is a datatype property. 
For instance, the expression 


Human MhasAge >is , 


where the datatype property has Age has been declared functional, will denote 
the crisp set of humans, which have an age less or equal than 18, while 


Human N Shas Age.ls(10, 30) 


will denote the fuzzy set of young humans (their age is 1s(10, 30)). 


Example 33. According to the fuzzy wine ontology*, we have the following 
inclusion 


SparklingWineN dhasSugar.ExtraDrySugarContentF or SparklingW ine 
E ExtraDrySparklingW ine 


where hasSugar is a functional role and 


ExtraDrySugarContentF or SparklingW ine 


3See http://nmis.isti.cnr.it/~straccia/software/FuzzyOWL/. 
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TABLE 10.3: From fuzzy SROTQ to fuzzy SROTQ(D). 


Concepts Syntax (C) Semantics: truth value of CT (æ) 
(G9) VT.d inf D TT (æ, v) > aP Œ) 


ved E D 

(C10) 3T.d sup caD 7*(2,v) 8d ©) e 
(C14) (2 n T.d) SUP aia vn e AD Op (TA (2, v,) 8 d7 (0,)) 9 9d1<j<k<n rj Á Vk 

5 : n+l (pT : Diy, . = 
(C15) (< m T.d) tn aie AD Oa (T7 (z, v4) @ d- (v4)) > Di<j<k<n+1 Vj = Vk 
Roles Syntax (R) Semantics of R(x, y) 
(X4) (a, v): T TL (at, vT) 
(X5) (a, v): >T ori (at, vt) 
Axiom Syntax (E) Semantics (Z satisfies E if...) 
(44) (a,v): T, n) Tfal, vt) > n 
(A5) (a, v): =T OTZ (af, vt) > n 
(A8) Tı E T2 inf CAZ veaD TZ (z, v) > TZ (a, v) 
(A11) dis(Tj , T2) ve € AŤ, v e AP, TĒ (x, v) = 0 or TZ (æ, v) = 0 


is a fuzzy concrete domain whose definition is 
tri(12, 16,20) 


in which the values represent the amount of sugar (grams per litre) a sparkling 
wine should have to be an extra dry sparkling wine. Informally, the closer the 
amount of sugar is around 12g/l or 209/l, the less the sparkling wine is an 
extra dry one. Optimal is a sugar level of 169/[. 

We refer the reader to the fuzzy wine ontology for many more definitions 
of this type. 


More generally, fuzzy SROZO(D) is obtained from fuzzy SROZO (see Ta- 
ble 10.1) by adding the constructs as illustrated in Table 10.3. 


Modifiers. Fuzzy modifiers (see Section 8.1.8) such as very and slightly, 
apply to fuzzy concepts to change their membership function. We re- 
call from Section 8.1.8 that a fuzzy modifier m represents a function 
(e.9., fuery(2) =lm(c)(x), see Figure 8.8) 


fim: [0,1] > [0,1] . 


Now, we extend the language of fuzzy concept constructors by allowing to 
apply a modifier m to a concept C or a concrete domain predicate d: i.e., 


C > m(C) | VP.m(d) | 3T.m(d) 


allowing, e.g., to express the concept 


Human N Shas Age. very(1s(10, 30)) 


denoting the fuzzy set of very young humans (their age is very(1s(10, 30)) (see 
also [48, 127, 194, 195, 196, 197, 198, 444]). 
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From a semantics point of view, we extend fuzzy interpretations in the 
obvious way 


m(C)J (2) = fm(07(0)) 
mid) (e) = fm(d?(@)) . 


Aggregation Operators. Eventually, as for the fuzzy propositional case (see 
Section 8.2.3), we may extend fuzzy DLs by allowing aggregation operators 
(see Section 8.1.6) to aggregate concepts, as illustrated, e.g., in [58, 59]. So, 
let @ be an n-ary aggregation operator as defined in Section 8.1.6, then we 
extend the language of fuzzy concepts by allowing to apply Q to n concepts 
Ca, TET Ch, 1.€., 

C > @C,...,Cn) 


allowing, e.g., to express the concept 
0.7 - ExpensiveHotel + 0.3 - LuxuriousH otel 


denoting the fuzzy set of expensive and luxurious hotels, whose membership 
function is the weighted sum of being an expensive and luxurious hotel. 

From a semantics point of view, we extend fuzzy interpretations in the 
obvious way 


MC... Cut a) = QCÍ(a),... CH (a). 


We conclude this section by illustrating how we analogously may encode Ex- 
ample 27 in fuzzy DLs. 


Example 34 ([48, 359]). Consider Example 27 concerning the matching about 
a car seller and a buyer. 
To start with, the background knowledge can be encoded as: 


Sedan E PassengerCar 

Satellite Alarm E AlarmSystem 

Navigator Pack = (Satellite Alarm N GPS_system) 
Insurance Plus = (Driver Insurance N TheftInsurance) 
TE ExColor Black U ExColorGray 

ExColor Black N ExColorGray EL 
fun(HasAlarmSystem) 

fun( Has AirConditioning) 

fun(HasExColor) 

fun( Has Navigator) 

fun(HasMWarranty) 

fun(HasPrice) 

fun(HasK MW arranty) 


The buyer's preferences can be encoded as follows: 
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1. B = (PassengerCar N JHasPrice. <26000) 
2. By = (AHasAlarmSystem.AlarmSystem > 3HasPrice.ls(22300, 22750)) 


3. B2 = (3HasInsurance.DriverInsurance3HasInsurance.(TheftInsurancel 
FireInsurance)) 


4. Ba = (AHas AirConditioning. Airconditioning aHasExColor.(ExColor Black 
ExColorGray)) 


5. Ba = (3HasPrice.ls(22000, 24000)) 
6. Bs = (3HasK MWarranty.rs(15000, 175000)) 
7. Buy = (BN(0.1- Bı +0.2- B2 +0.1- Bs +0.2- Ba +0.4- Bs)) 


In a similar way, the seller’s preferences can be encoded as follows: 


1. S = (Sedan N 3HasPrice. >22000) 
„ Sı = ((3Has Navigator. Navigator Pack) > (3HasPrice.rs(22500, 22750) ))) 


2 

3 = (3HasInsurance.InsurancePlus) 

4. S3 = (3 HasK MWarranty.ls(100000, 125000) ) 
5 ( 

6 

7 


3HasM Warranty.ls(60,'72)) 
. S5 = ((AHasExColor.ExColor Black) — (3 Has AirConditioning. AirConditioning)) 
. Sell = (SN (0.3 - Sı +0.1-S2 +0.3- S3 + 0.1- S4 +0.2- 55) 


Now, as for Example 27, the best agreement among the buyer and the seller is 
determined by the mazimal degree of satisfiability of the concept conjunction 
Buy NM Sell, i.e., 

bsd(K, Buy N Sell) , 


for which we have the analogous results* 


bsd(K, Buy Sell) = 0.7 
HasPrice = 22000.0 
HasK MWarranty = 175000.0 
HasMWarranty = 0.0. 


It is easy to see that Example 28 can be encoded in fuzzy DLs in a similar 
way (see [48]). 


10.4 Fuzzy Conjunctive Queries 


Concerning query answering, we extend naturally the notion of fuzzy con- 
junctive queries defined in Section 9.3 for fuzzy RDFS (see Chapter 9) to the 
fuzzy DLs case. 


“The values of a data property are the values of the data property corresponding to the 
individual that maximizes the degree of truth of the concept Buy NM Sell. 
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Informally, fuzzy DL queries are as for fuzzy RDFS where fuzzy triples are 
replaced with fuzzy FOL assertions in which fuzzy variables (taken from an ap- 
propriate alphabet and denoted A) may occur and where the fuzzy predicates 
are either unary (for atoms) or binary (for roles). 

To start with, a fuzzy query is of the form 


(q(x), A) E IyJA' .p(x, A, y, A’) (10.3) 


in which y(x,A,y,A’) is a conjunction (as for the crisp case, we use “,” 
as conjunction symbol) of fuzzy unary or binary fuzzy atoms and built-in 
predicates, x and A are the distinguished variables, y and A” are the vectors 
of non-distinguished variables (existential quantified variables), and x, A, y 
and A’ are pairwise disjoint. Variable A and variables in A’ can only appear 
in place of degrees of truth or built-in predicates. The query head contains at 
least one variable. 
For instance, the query 


(q(x), 8) — (SportsCar(x), 81), hasPrice(x, y), s:=81 - 1s(10000, 1500) (a) 


has intended meaning to retrieve all cheap sports cars. Any answer x is scored 
according to the product of being cheap and a sports car. 

From a semantics point of view, we rely on the one for fuzzy RDFS and 
integrate one of the classical DLs, described in Section 4.3. So, given a fuzzy 
DL KB K, a query (q(x), A) € Sy5A’.y(x, A, y, A’), a vector t of individuals 
occurring in K and a truth degree A in [0,1], we say that (q(t), A) is entailed 
by K, denoted K = (q(t), A), if and only if there is a vector t’ of individuals 
occurring K and a vector A of truth degrees in [0, 1] such that for any model Z 
of K, T is a model of all fuzzy atoms occurring in y(t, A, t’, 1). If K E (q(t), A) 
then (t, A) is called an answer to q. The answer set of q w.r.t. K is 


ans(K, q) = (6,4) | X£ E (g(t), A), A #0 and 
for any AX Z A such that KE (q(t), A’), XY <A holds} . 


That is, for any tuple t, the truth degree A is as large as possible. 
Please note that, similarly to Remark 6 we have: 


Remark 31. There is a subtle difference with the usual definition of answer, 
stated as follows: 


Given a knowledge base K, a query (q(x), A) E SySA’.y(x, A, y, A’), 
and a vector t of individuals occurring in K and a truth degree A in 
[0, 1], we say that (q(t), A) is entailed by K, denoted K [E (q(t), A), 
if and only if for any model T of K, T is a model of the FOL for- 
mula Jy3A/” y(t, y), according to the standard definitions of first- 
order logic. We denote the answer set according to this definition 
as ansPOL(K, q). 
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The following example shows the difference between our definition of answer 
set and the usual one according to Remark 31. So, consider the simple knowl- 
edge base 


K = {((a,c):R,0.7), (b:4R, 0.8)} 


and the conjunctive query 


(a(z), 8) E (R(@, y), 8) - 


Then it is easily verified that 


As for the crisp DL case, the difference is due to the fact that in our case the 
instantiation of the non-distinguished variables has to be known independently 
from any model of K, while according to Remark 31, for any model Z of K, the 
non-distinguished variables y, A” may be substituted with domain elements 
depending on Z. Of course, 


ans(K, q) C ans*°*(K, q) 


holds. As for the crisp DL case, in general, our definition here is amenable to a 
more efficient implementation, as in fact, determining whether a € ans(K, q) 
inherits the computational complexity of the complexity of entailment in the 
underlying fuzzy DL. Indeed, a simple procedure to determine the answer set 
ans(K, q) consists in computing off-line the instances of all atomics concepts 
and roles occurring in K, store them into a relational database DBx, convert 
q into and SQL query qsoz and submit gsgr. 


Disjunctive fuzzy queries with aggregation operators. As for the fuzzy 
RDFS case, we may extend conjunctive queries to disjunctive queries and to 
queries including aggregation operators as well. 

Formally, let @ be an aggregate function with 


Q e (SUM, AVG, MAX, MIN, COUNT, 6, 9) 
then a query with aggregates is of the form 


(q(x), A) < Iy3A'.p(x, y, A’), 
GroupedBy(w), (10.4) 
A:=@[f(z)] , 


where w are variables in x or y and each variable in x occurs in w and any 
variable in z occurs in y or A’. 
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From a semantics point of view, we say that Z is a model of (satisfies) 
(q(t), A), denoted Z H (q(t), A) if and only if 


A=QlA1,..., An] where g = {(t, th, 41), ---, (t, A, th, Ag) Jo 
is a group of k tuples with identical projection 
on the variables in w, y(t, t/,, XA.) is true in Z 


and A, = f(t) where t is the projection of (t/., A1) 
on the variables z . 


Now, the notion of K — (q(t), A) is as usual: any model of K is a model of 
(q(t), A). 

We conclude by defining a disjunctive query q as usual as a finite set of 
conjunctive queries in which all the rules have the same head. Intuitively, the 
answers to a disjunctive query are the union of the answers of the conjunctive 
queries. That is, for a disjunctive query q = {q1,---,@m}, K entails q (denoted 
K E q) iff K = qi for some q; € q. 

The notion of answer and answer set of a disjunctive query is a straight- 
forward extension of the ones for conjunctive queries. 


Top-k Retrieval. As now each answer to a query has a degree of truth 
(i.e., score), a basic inference problem that is of interest is the top-k retrieval 
problem, formulated as follows. 

Given a fuzzy KB K, and a query q, retrieve k answers (t, A) with maximal 
degree and rank them in decreasing order relative to the degree A, denoted 


ansy(K, q) = Top, ans(K, q) . 


10.5 Representing Fuzzy OWL Ontologies in OWL 


As pointed out in Section 4, OWL [338] and its successor OWL 2 [101, 340] 
are standard W3C languages for defining and instantiating Web ontologies 
whose logical counterpart are classical DLs. So far, several fuzzy extensions 
of DLs exists and some fuzzy DL reasoners have been implemented, such as 
FUZZYDL [48], DELOREAN [42] and FIRE [153, 385], SOFTFACTS [421] and 
DLMEDIA [435, 420]. 

Not surprisingly, each reasoner uses its own fuzzy DL language for rep- 
resenting fuzzy ontologies and, thus, there is a need for a standard way to 
represent such information. 

A first possibility would be to adopt as a standard one of the fuzzy ex- 
tensions of the languages OWL and OWL 2 that have been proposed, such 
as [168, 382, 383]. However, as it is not expected that a fuzzy OWL exten- 
sion will become a W3C proposed standard in the near future, [54, 57, 60] 
identifies the syntactic differences that a fuzzy ontology language has to cope 
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FIGURE 10.1: Annotation property defining concept BuyerPreferences. 


with, and proposes to use OWL 2 itself to represent fuzzy ontologies. More 
precisely, [60] uses OWL 2 annotation properties to encode fuzzy SROZO(D) 
ontologies. The use of annotation properties makes it possible (¿) to use cur- 
rent OWL 2 editors for fuzzy ontology representation, and (ii) that OWL 2 
reasoners discard the fuzzy part of a fuzzy ontology, producing almost the 
same results as if it would not exist. In order to support this methodology for 
fuzzy ontology representation, [60] describes an implementation of a Protégé 
plug-in to edit fuzzy ontologies and some parsers that translate fuzzy ontolo- 
gies represented using this methodology into the languages supported by some 
fuzzy DL reasoners. 

Roughly, firstly, we can build the core part of the ontology by using any 
ontology editor supporting OWL 2, such as Protégé 4.1° [200, 333]. This allows 
reasoning with this part using standard ontology reasoners. Then, we can add 
the fuzzy part of the ontology by using annotation properties. Representing 
the fuzzy information using OWL 2 annotations can also be done with an 
OWL 2 ontology editor (see, e.g., Figures 10.1 and 10.2). 


Shttp://protege.stanford.edu/ 
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FIGURE 10.2: Annotation property defining fuzzy datatype 
ls(22000, 24000). 


However, typing the annotations is a tedious and error-prone task, so a 
Protégé plug-in that makes the syntax of the annotations transparent to the 
users has been developed. The Fuzzy OWL 2 plug-in is publicly available on 
the Web [165]. Once installed, a new tab Fuzzy OWL enables the plug-in. 
The plug-in has a menu with the available options (see Figure 10.3). The user 
can choose to define fuzzy elements in the ontology (fuzzy datatypes, fuzzy 
modified concepts, weighted concepts, weighted sum concepts, fuzzy nomi- 
nals, fuzzy modifiers, fuzzy modified roles, fuzzy axioms, and fuzzy modified 
datatypes), and he/she can specify the fuzzy logic used in the ontology. 

Figure 10.4 illustrates how the plug-in works by showing how to create 
a new fuzzy datatype. The user specifies the name of the datatype, and the 
type of the membership function. Then, the plug-in asks for the necessary 
parameters according to the type. A picture is displayed to help the user 
recall the meaning of the parameters. Then, after some basic error checking, 
the new datatype is created and can be used in the ontology. 

Furthermore, the plug-in is integrated with the fuzzyDL [164] reasoner [41] 
and makes it possible to submit queries to it. For the moment, such queries 
must be expressed using the particular syntax supported by fuzzyDL. 

We are not going into more detail here and refer the reader to [60] and 
the FuzzyOWL2 web site [165], from which one may download some fuzzy 
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FIGURE 10.3: Menu options of the Fuzzy OWL 2 plug-in. 


OWL 2 ontologies as well, such as a fuzzy wine ontology, an ontology for 
matchmaking, and multi-criteria decision making. 


10.6 Reasoning 


In Section 4.4 we have described the various reasoning problems of in- 
terest for crisp DLs such as the consistency problem, the subsumption prob- 
lem, the instance checking problem, and the instance retrieval problem, which 
have their natural fuzzy analogue. Other major reasoning problems are the 
best entailment degree- and the best satisfiability degree problems (see Equa- 
tion (10.1) and Equation (10.2) in Section 10.2.1). 

In summary, in fuzzy DLs the following problems are of interest. 


Consistency problem: 


e Is K satisfiable? 
e Is C coherent, i.e., is C7 (x) > 0 for some model Z of K and x € A7? 


Instance checking problem: 


e Does K E (a:C,n) hold? 
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FIGURE 10.4: Creation of a fuzzy datatype with the Fuzzy OWL 2 plug-in. 


Subsumption problem: 


e Does K H (C E D,n) hold? 


Best entailment degree problem: 
e What is bed(K, q)? 
Best satisfiability degree problem: 
e What is bsd(K, p)? 
Instance retrieval problem: 
e Compute the set {(a,n) | n = bed(K, a:C)} 
Note that, as for the crisp case in order to determine whether 
K E ((a,b):R,n) , 


we may reduce it to the instance problem, as 


KE ((a,b):R,n) iff KU {(b:B,1)} H (a: 3R.B,n) , 


where B is a new concept. 
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Similarly as for the crisp case, all the above problems can be reduced to 
satisfiability degree problems as long as the below presented reductions are 
supported by the underlying DL language (if not then specific algorithms have 
been developed): indeed, we have 


Remark 32 (Fuzzy DL problem reductions). The following problem reduc- 
tions hold: 


e K is satisfiable iff bsd(K,a:1) > 0, where a is a new individual. 
e C is coherent w.r.t. K iff one of the following holds: 


— KU {(a:C > 0)} is satisfiable, where a is a new individual; 
-K (C El ps 
— bsd(K,a:C) > 0, where a is a new individual. 


e KE (a:C,n) iff one of the following holds: 


— KU {(a:C < n)} is not satisfiable; 
— bed(K,a:C) > n. 


e K= (CE D,n) iff one of the following holds: 


— KUf{(a:C > D < n)} is not satisfiable, where a is a new individual; 
— bed(K,C E D) >n. 


e Similarly to the propositional fuzzy logic case (see Equation (8.53) 
and Equation (8.54)), we have that 


bed(K,ġ) = minz. such that KU {(¢ġ < x)} satisfiable (10.5) 
bsd(K, ¢) max. such that KU {(¢ > x)) satisfiable . (10.6) 


| 


10.6.1 The Case of the AL Family 


We describe here the major reasoning frameworks within the fuzzy AL 
family. As for the crisp case (see Section 4.4.1), we describe here decision 
algorithms for the knowledge base satisfiability problem for the fuzzy DL 
ALC, which is sufficiently expressive to illustrate the main characteristic of 
the methods, which follows to some extend those presented in Section 8.2.2 
for the fuzzy propositional case. 

But before we start, let us discuss some very important consequences 
of Remark 28. The literature contains several reasoning algorithms for 
Lukasiewicz [49, 425, 426], Product [47], or any left continuous t-norm fuzzy 
DLs [53, 387]) that claim to support GCIs. These algorithms restrict them- 
selves to witnessed models. Unfortunately, these papers are implicitly assum- 
ing that the logic satisfies FMP. However, by Remark 28 we have shown that 
this assumption cannot be done. 
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The fuzzy KB K' is an example showing that some algorithms for 
Lukasiewicz logic [49, 425, 426] do not work properly. In fact, these algorithms 
say that X’ is satisfiable, whereas it is not. More precisely, the blocking condi- 
tion (see later on) of the algorithms fails due to an implicit assumption of the 
FMP: it makes the algorithms stop after having created individual b2, whereas 
the inconsistency appears at individual b3. 

Similarly, K’ is also a counterexample to the algorithm for product 
logic [47] and it is not difficult to obtain counterexamples for the algorithms 
that consider other t-norms [53, 387] by doing small modifications in K’; it is 
only necessary to change the degree of truth used in axiom (5). 

Consequently, the proofs of the correctness of the cited algorithms are only 
valid if we add some additional restrictions: 


e (49, 425, 426] consider the fuzzy DL ALC under Eukasiewcz fuzzy logic. 
The proof is only valid in case we only consider acyclic KBs. 


e [47] consider the fuzzy DL ALC under product fuzzy logic. Here, the 
proof is only valid if we only consider unfoldable KBs. 


e [53] provides reasoning algorithms for the fuzzy DLs ALC defined by 
families of fuzzy operators corresponding to a left-continuous t-norm ex- 
tended with an involutive negation®. This work restricts itself to acyclic 
KBs, so for instance K cannot be represented in the logic. Here, the proof 
is correct for Lukasiewicz logic, while the proof of the correctness of the 
algorithm for acyclic KBs in a fuzzy logic different from Lukasiewicz is 
not valid, while the proofs are correct for unfoldable KBs. 


e [387] provides a reasoning algorithm for the fuzzy DLs ST defined by 
families of fuzzy operators corresponding to a left-continuous t-norm. 
Here, the semantics of GCIs is defined using Zadeh’s set inclusion, but 
the proof of the correctness of the algorithm is only valid if we restrict 
to unfoldable KBs, or to acyclic KBs and Lukasiewicz fuzzy logic. 


In the case of finite Lukasiewicz fuzzy logic (that is, if a finite set of degrees 
of truth is assumed), the algorithms [49, 425, 426] would not work properly 
either. As we have shown, the models of K require an infinite number of 
degrees of truth, so K is unsatisfiable under a finite number of degrees of 
truth. However, these algorithms would say that K is satisfiable. 


Remark 33. Finally, we would like to add a remark that should be taken into 
account in the algorithms for reasoning in fuzzy DLs that include GCIs and 
the weighted sum constructor [48]. It is not hard to see that, even in SFL, one 


SHere the fuzzy logic corresponding to a left continuous t-norm & is understood as based 
on the connectives given by the t-norm ®, its residuum, Lukasiewicz negation, and the dual 
t-conorm of ®. This is quite different than the logic of left-continuous t-norms as used 
in [186, 188, 189], and closer (except for the use of truth constants as concept constructors) 
to the framework in [169]. 
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can simulate fuzzy KBs that do not satisfy the FMP if we take K and replace 
axiom (4) in Example 27 with 


A = (0.5-VR.A+0.5-VR.A) . 


Hence, in this case, we also need to restrict to unfoldable TBoxes to guaran- 
tee the FMP. In fact, the KB satisfiability problem in ALC under SFL with 
weighted sum constructor is indeed undecidable (see below). 


The issues raised by [39] generated a series of undecidability results [26, 27, 
28, 69, 74, 92] caused essentially by cyclic TBoxes (see [74] for a more detailed 
account) and non Gédelt-norms: 


Proposition 69 ([74]). Assume that fuzzy GCIs are restricted to be classi- 
cal, i.e., of the form (a,1) only. Then for the following fuzzy DLs, the KB 
satisfiability problem is undecidable: 


1. ELC’ with classical axioms only under Lukasiewicz logic and product 
logic; 


2. ELC under any non Godelt-norm 8; 


3. ELC with concept assertions of the form (a = n) only under any non 
Godelt-norm 8; 


4. AL with concept implication operator >9 and concept assertions of the 
form (a = n) only under any non Gédelt-norm Y. 


5. ELC under SFL with weighted sum constructor. 


On the other hand it is remarkable to note that from the results in [46] it 
follows immediately that 


Proposition 70 ([46]). The KB (witnessed) satisfiability problem is decidable 
for the fuzzy DL SROTQ under Gödel logic, under SFL and any finitely valued 
truth space. 


As we will see later on, the result of Proposition 70 is obtained from the fact 
that under Gödel logic, under SFL and any finitely valued logic, the satisfi- 
ability problem of a fuzzy SROTQ KB can be reduced to a crisp SROTQ 
satisfiability problem, which is known to be decidable (see Section 4.2 and 
[203]). 


TEL with complement. 
8See Section 10.2.2. 
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10.6.1.1 Reduction to Classical Logic 


To start with, let us show how one may reduce the KB satisfiability prob- 
lem in fuzzy DLs into crisp KB satisfiability problems under Gódel logic, under 
SFL and any finitely valued logic. Besides obtaining decidability results, the 
method has also the advantage that classical DL reasoners can be used to 
reason within these fuzzy DLs. For instance, this is the method followed by 
the fuzzy DL reasoner DELOREAN [42]. 

To illustrate the method, we recap [396], which first proposed the method, 
which then has been extended and improved in further works such as [41, 43, 
44, 45, 46, 50, 56, 61, 383, 405]. Specifically, 


e [396] proposed a reasoning preserving procedure for fuzzy ALCH into 
crisp ALCH, under SFL; 


e [105] extends [396] to the case of fuzzy ALC where the truth space is 
any finitely valued complete lattice; 


e [44] extends [396] to the case of fuzzy SROTQ(D); 
e [45] extends [396] to the case of fuzzy SROTQ(D) under Gödel logic; 


e [61] extends [396] to the case of fuzzy SROZO under finitely many- 
valued Lukasiewicz logic; 


e [46] extends [44] to the case of fuzzy SROZO(D) under Gödel logic and 
under SFL. The results extends immediately to any finitely valued fuzzy 
SROTQ(D) as well. 


Before we formally present the method, we first illustrate the basic idea it relies 
on, which is based on the ideas presented in [36, 37] for so-called regular multi- 
valued logics and somewhat resembles the method proposed in Section 8.2.2.4 
for fuzzy propositional logic. 

Assume we have a K = (Ø, A), where A contains 


(a: A, 0.6) 
(a: A,0.7) . 


Clearly, K is not satisfiable. Now, similarly to Equation (8.61), let N be the 
set of truth degrees that occur in K, their negation plus {0, 0.5, 1}, ie., 


NK = {0,0.3, 0.4, 0.5, 0.6, 0.7, 1) 


and define 
NE = NE \ {0}. 


The important point here is that K has a model iff K has a model based on 
the truth space N K and, thus, allows us to restrict our attention to the values 
in NK only. 
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Now, for each a € NK we introduce new concepts A>a- Axa represents 
the crisp set of individuals which is an instance of A with degree higher or 
equal than a, ¿.e., the a-cut of A. Of course, we have to consider also the 
relationships among the introduced concepts. so, e.g., we need the axioms 


A>0.5 


A>0.6 


For instance, axiom A>0.5 E A>0.4 dictates that if a truth value is < 0.5 then 
it is also > 0.4. Now, we will map 


(a:A,0.6) ++ a:Aso.6 
(a:7A, 0.7) > a:nAso.4 « 


The argument behind the last map is that (a:4,0.7) dictates that 
(a:A < 0.3), which restricted to the truth space NK is the same as saying 
that a cannot be an instance of A to degree > 0.4. This illustrates our basic 
idea. 

Formally, consider fuzzy ACCH under SFL. We will assume that fuzzy 
KBs do not include fuzzy axioms of the form (a, 0), (a:4C,1). The reason is 
that they are always satisfied. Let K = (T, A, R} be fuzzy ALCH KB under 
SFL. Consider 


XK = {0,0.5,1}U {e| (a,c) occurs in K} (10.7) 


from which we define 


NE = XFuf{i—c|eex*} (10.8) 
NS = NE {Obs (10.9) 


Essentially, NK (which is finite) contains the truth degrees that occur in X 
and their negation. The important point here is that K has a model iff K 
has a model based on the truth space NK and, thus, allows us to restrict our 
attention to the values in N* only. This property derives directly from [395]: 


Proposition 71. A fuzzy ACCH KB K under Gödel logic and under SFL 
has a model iff it has a (witnessed) model over the truth space NK. 


In the following, we may assume now that the truth space M is a finite chain 
of p + 1 elements: namely 


N={0=%<U<---<y=]}, 
where p > 2, from which we define 


Ny =N\ {70} - 
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For instance, in the case of fuzzy ALCH KB K under SFL, 
N = NE. (10.10) 


Degrees of truth will be denoted as y€ M and a EN}. We will also define 
EN = Vi i = Vi-1- 

We will also use pa € {>,>,<,<}, > € {>,>}, < € {<,<}. The 
symmetric pd, and the negation — ™ of an operator ™ are defined as follows: 


Now, let A be the set of atomic fuzzy concepts and R the set of atomic fuzzy 
roles in a fuzzy KB K = (T, A,R), respectively. For each œ € N}, for each 
A € A, a new atomic concept A>. is introduced. Similarly, for each Ra € R, 
a new atomic role RA>a is created. 
For each 1 < i < p—1 and for each A € A, T(N) is the smallest TBox 
containing these axioms: 
Asis AS 3 (10.11) 


Similarly, for each Ra € R, R(N) is the smallest RBox containing: 


Rai = Rad»; i (10.12) 


Fuzzy concept and role expressions are reduced by using mapping p, as shown 
in Table 10.4. Given a fuzzy concept C, p(C, > a) is a crisp set containing all 
the elements which belong to C with a degree greater than or equal to a. The 
other cases p(C, xy) are similar. p is defined in a similar way for fuzzy roles 
and this equivalence also holds. 

Axioms are reduced as in Table 10.5, where «(7) maps a fuzzy axiom 7 
into a set of crisp axioms. We note «(A) the union of the reductions of all 
the fuzzy axioms in A. Analogously, «(7) is the union of the reductions of all 
fuzzy concepts in 7, where as k(R) is the union of the reductions of all fuzzy 
roles in R. 

Let crisp(K) denote the reduction of a fuzzy ontology K. That is, a fuzzy 
KB K = (T, A, R) is reduced into a KB 


crisp(K) = (T(N) UK(T), K(A), RW) U K(R)) . 
Now, it can be shown that 


Proposition 72 ([46, 396]). A ALCH fuzzy KB K under SFL is satisfiable 
iff its crisp representation crisp(K) is satisfiable. 


A consequence of Proposition 72 is that 
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TABLE 10.4: Crisp mapping of concept and role expressions 


p(T, by) = 
A(T, ay) =L 
pL, oy) =L 
p(l, a) = 
P(A, > 7) =A>, 
P(A, > y) =Ad4y 
p(A, < Y) =A>+y 
p(A, < y) =>A>, 
p(=C,0a y) =p(C, x Ozy) 
p(C ON D, by) =p(C, >) p(D, >y) 
p(C OD, <7) =p(C, <7) U p(D, <7) 
p(C UD, by) =p(C, >y) U p(D, >77) 
p(C UD, ay) =p(C, <7) N p(D, <7) 
P(VR.C,> y) =Vp(R, > Oy).p(C, > 7) 
P(VR.C, > y) =Vp(R, > Oy).p(C, > 7) 
P(VR.C, ay) =3p(R, <7 O y).p(C, <7) 
P(AR.C, by) =3p(R, >y).p(C, >) 
p(AR.C, <7) =Vp(R, =< 1).p(C, <7) 
P Ra 2 Y) = Raza 
p(Ra, > 7) = Razi 


201 


Proposition 73 ([46, 396]). Given a ALCH fuzzy KB K. Then under Gódel 
logic or SFL, bed(K,a) € NK. Moreover, given a fuzzy KB K over a finitely 


valued truth space N, then bed(K, a) e N. 


Therefore, by a binary search on NÝ (reps. M), the value of bed(K, a) can be 
determined in at most log|N*| entailment tests and, thus, crisp entailment 
tests. Therefore, the BED problem can be reduced to the crisp case as well. 
The reductions for other logics such as Gödel logic or Lukasiewicz logic are 
based on a similar principle [46, 61]). For instance, under Gédel logic, we have, 


TABLE 10.5: Crisp reduction of the fuzzy DL axioms 


K({a:C,y)) = {a:p(C, > y) 

K(((a, b):R,y)) = {(a, b):o(R, > 7) } 

K((C E D,n)) ={p(C, > 97) E p(D,> y) 
K(CED) =Uaey, to(C,> a) E p(D,> a)) 
K(A=C) =Unew, {0(A, 2 a) = p(C, > a)) 

RURE S, y)) ={o(R, > Oy) E p(R,> 7)} 
(RES) =Uuen, to(R,> a) E p(S, > a)) 
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e.g., 


K((C E D,y)) =Uaes; | a< {P(C 2 a) E p(D, 2 a)} , 


while over finitely valued Lukasiewicz logic, we have, e.g., 


«(CED y) = UtoC,> 1) E p(D,> m) 
1572 
for every pair 71,72 € NT such that y =y2+1-y. 


The mapping p also changes: e.g., under Gödel logic, we have, e.g., 


pWRC,>7) = [| (Wol(R,>a).0(C,2 0), 
aEN4 | ay 


while over finitely valued Lukasiewicz logic, we have, e.g., 


71,72 
for every pair 71,72 E N* such that yı =72+1-7. 


However, despite being the method simple to implement (see the DE- 
LOREAN [42]) and the size of the crisp KB linearly bounded by the size of 
ITUNE] (resp., [T|IN |), from a practical point of view the method does not 
scale yet, especially for the BED problem. For other logics, such as Gödel 
logicor Lukasiewicz logic, the reduction may become even exponential. 

It is not hard to be convinced that the crisp reduction method can easily 
be extended to SROZQ(D) under any other finitely valued DL as well, as 
explained at the beginning of this section and, thus, proofs Proposition 70. 


10.6.1.2 Analytical Fuzzy Tableau 


We recall here the typical fuzzy tableau method employed for fuzzy DLs 
of the fuzzy AL family under SFL semantics, introduced in [395] for acyclic 
ALC and later on extend to fuzzy ALC with classical GCIs [390] and then 
extended to more expressive DLs as in, e.g., [73, 384, 387, 389, 397, 405]. 

Specifically, 


e [395] provides a tableau for acyclic fuzzy ALC KBs. 
e [390] extents [395] to deal with ALC KBs with classical GCIs. 
e [105] deals with acyclic ALC over finite complete lattices. 


e [73] extends [405] to SHT with graded GCIs over finite complete resid- 
uated De Morgan lattices. 


e [389] extends [395] to acyclic fuzzy SHIN KBs. 
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The method starts from the tableau algorithm for crisp DLs illustrated in 
Section 4.4.1 and applies the ideas developed for the analytical tableau for 
fuzzy propositional logic as from Section 8.2.2.3. So, let K = (7,A) be an 
ALC knowledge base under SFL. 

At first, we transform any concept in K into negation normal form (NNF), 
i.e., negation occurs only in front of atomic concepts. To do so, as for the crisp 
case we push all negation signs as far as possible into the description, using de 
Morgan’s rules and the usual rules for quantifiers: that is, we iteratively apply 
the transformation rules of Equation (4.3) for the crisp case, which indeed are 
semantics preserving under SFL (see Table 10.2): 


aT > C 
(CAD) >» CUAD 
A(CUD) = CHAD 

RC + VRAC 


WRC > IRC. 


With nnf(C) we denote the negation normal form of C, obtained by applying 
the rules above. 

Next, as for the fuzzy propositional case (see Equation (8.62)), we define e 
as half of the minimal absolute difference among the values in NKP. That is, 


e = min{d/2 | nım € NEP d = jn- ml} . (10.13) 
Now we have that the analogue of Proposition 55 


Proposition 74 ([395]). Let K be a fuzzy KB in SFL. Then for n > 0 and 
0<n, <05< nr <1 


KE (a:C,n) iff KU{(a:nC,1—n-+)} is not satisfiable 
KE(CED,n) iff KU{(a:CNAD),1—n+6)} is not satisfiable 
KECCD iff KU{(a:C,n)} E (a:D,n) form € [n1,n2) 


KE A=C iff KE ALC andK ECLA. 


Proposition 74, like for the fuzzy propositional case and contrary to the usual 
analytical fuzzy tableau methods such as [384, 387, 389, 390, 395, 397, 405], 
allows us now to use an analytical tableau procedure to decide the KB sat- 
isfiability problem and the entailment problem without requiring the use of 
fuzzy statements of the form (a e n) with e € {<, <, >}, which simplifies the 
calculus. 

Now, we try to construct a finite fuzzy model Z of K via a tableau al- 
gorithm. Essentially, the tableau algorithm is a terminating algorithm that, 
starting from an ABox, tries to build a clash-free forest of trees (called 
completion-forest). If it succeeds then K is satisfiable and from the forest 
a model can be built. Otherwise, K is not satisfiable. 
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The Case with Empty TBox. As for the crisp case (see Section 4.4.1), we 
start with the case of an empty TBox. So, Let K be a KB in NNF with empty 
TBox. A completion-forest F for K is a collection of trees whose distinguished 
roots are arbitrarily connected by edges. Each node v is labelled with a set 
L(v) of expressions (C,n), where n € (0,1] and C € sub(K). The intuition 
here is that v is an instance of C to degree greater or equal than n. We will 
always assume that there cannot be (C,n) € £(v) and (C,m) € L(v) with 
m > n. If there are, the expression with the lower degree is dropped from 
L(v). 

Each edge (v, w) is labelled with a set £((v, w)) of expressions (R, n}, where 
R occurs in K, indicating that (v,w) and instance of R to degree greater 
than or equal to n. As for £(v), we will always assume that there cannot 
be (R,n) € L((v, w)) and (R,m) € L((v,w)) with m > n. If there are, the 
expression with the lower degree is dropped from £((v, w)). 

If nodes v and w are connected by an edge (v, w) with (R,n) € L((v, w)) 
then w is called an R,-successor of v and v is called an R,,-predecessor of w. 
Ancestor is the transitive closure of predecessor, where we omit the degrees. 

For a node v, £(v) is said to contain a clash iff £(v) contains either 


e an expression (1,n) with n > 0; or 
e a pair of expressions (A,n) and (~A, m) with n > 1—m. 


A completion-forest is called clash-free iff none of its nodes contain a clash; it 
is called complete iff none of the expansion rules in Table 10.6 is applicable. 
Now, the algorithm initializes a forest F as follows: 


e F contains a root node vj, for each individual a; occurring in A; 


e F contains an edge (vj, vj), for each ((a;,a;):R,n) € A; 
e for each (a;:0,n) € A, we add (C,n) to L(vj); 


e for each ((a;,a;):R,n) € A, we add (R,n) to L((v), vgy). 


Then the completion-forest F is then expanded by repeatedly applying the 
completion rules described in Table 10.6 and answers “X is satisfiable” iff the 
completion rules can be applied in such a way that they yield a complete and 
clash-free completion-forest. Note that it is relatively easy to build a model T 
from a complete and clash-free completion-forest. Informally, 


e the domain of Z are the nodes of the forest; 
e the interpretation of individual a; is vĝ; 

e if (R,n) € L((v,w)), then R7(v,w) = n; 

e if (A,n) € L(v), then 4%(v) =n. 
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TABLE 10.6: The tableau rules for fuzzy ALC with empty TBox. 


(N). If (i) (C,MCa3,n) € L(v) and (ii) {(C1,n), (Co,n)} Z Lv), then add 

(Ci n) and (C2,n) to L(v). 

(1). If (i) (Cy U C2,n) € L(v) and (îi) {(C1, n), (C2,n)}} N L(v) =0, then add 
some (C,n) € ((C1,n),(Ca,n)y to L(v). 

(Y). If (i) WR.C,n) € L(v), (îi) (R,m) € L((v,w)), m > 1— n, and (iii) 
(C,n) Z L(w), then add (C,n) to L(w). 

(3). If ©) (AR.C,n) € L(v) and (ii) there is no (R,ni) € L((v,w)) with 

(C,na) € L(w) such that min(n1,n2) > n, then create a new node w, 

add (R,n) to £((v, w)) and add (C,n) to L(w). 


Now, termination, soundness, and completeness of the algorithm have been 
shown. 


Proposition 75 ([395]). For each knowledge base K = (0, A), 
1. the tableau algorithm terminates; 


2. if the expansion rules can be applied in such a way that they yield a 
complete and clash-free completion-forest, then K has a finite model; 


3. if K has a model, then the expansion rules can be applied in such a way 
that they yield complete and clash-free completion-forest for K; 


4. the KB satisfiability problem is PSpace-complete [395]. 
Example 35. Consider a fuzzy analogue of Example 9: 
K = {(aVR.ANAR.B, 0.9), ((a, b):R, 0.8), ((a,c):P,0.7)} . 


K is satisfiable as there is a clash-free completion-forest (see Figure 10.5). 
The model build from the forest is: 


e A? = {a,b,c £}; 

e the interpretation of individuals is the identity function; 
e R7(a,b) = 0.8, R? (a,x) = 0.9, P7 (a,c) = 0.7; 

e AT(b) = A“ (x) = Bł (x) = 0.9. 


The Case of Acyclic TBox. We next extend the previous algorithm to the 
case of acyclic TBoxes. 

Similarly to Section 4.4.1.2, we also present here a method, called lazy 
unfolding. To do so, we extend our calculus in Table 10.6 with the rules in 
Table 10.7. 

As for Proposition 75, it can be shown that 
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((WR.AM3R.B,09),(R.4,09),(3R.B,09) „SP > 
<R,0.8> <R,0.9> 
{<A,0.9>} P x (<A,0.9>,<B,0.9>) 


FIGURE 10.5: Clash-free complete completion-forest from fuzzy ABox. 


Proposition 76. For each knowledge base K = (T, A) with acyclic T, 
1. the tableau algorithm terminates; 


2. if the expansion rules can be applied in such a way that they yield a 
complete and clash-free completion-forest, then K has a finite model; 


3. if K has a model, then the expansion rules can be applied in such a way 
that they yield a complete and clash-free completion-forest for K. 


Example 36. Consider Example 35, where we add 
B Eos C 


to K. K is satisfiable as there is a clash-free complete completion-forest (see 
Figure 10.6). 


TABLE 10.7: The tableau rules for fuzzy ALC with acyclic fuzzy KB. 


(24) If AC, CET, (A,m) € L(v) with m > 1—n and (C, n) ¢ L(v) then 
add (C, n) to L(v). 


(E*) If AEC €T, (A,n) € L(v) and (C, n) ¢ L(v) then add (C, n) to £(v). 


(=4) f A=, CET, (A, 
add (C, n) to L(v). 
( 


m) € L(v) with m > 1 — n and (C, n) ¢ L(v) then 


(=4) If A=, C € T, (54,m) € L(v) with m > 1—n and (nnf(=C),n) g 
L(v) then add (nnf(=C),n) to L(v). 


(=4) If A=C €T, (A,n) € L(v) and (C, n) ¢ L(v) then add (C, n) to L(v). 


(=A) If AZC € T, (A,n) € L(v) and (nnf(aC),n) ¢ L(v) then add 
(nnf(AC),n) to L(v). 
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((WR.AM3R.B,0.9), (YR.A,0.9), (3R.B,0.9}} O 
<R,0.8> <R,0.9> 
{<A,0.9>} © X (<A,0.9>,<B,0.9>, <C,0.4>} 


FIGURE 10.6: Clash-free and complete completion-forest from acyclic fuzzy 
KB. 


The model built from the forest is: 

e A? = {a,b,c x}; 

e the interpretation of individuals is the identity function; 
e R7(a,b) = 0.8, R? (a,x) = 0.9, P7 (a,c) = 0.7; 

e AT(b) = AF (x) = Bł (x) = 0.9,C7 (x) = 0.4. 


The Case with General TBox. Eventually, we show here how we may deal 
with the case in which the GCIs in the TBox are of the general form (C E D,n) 
and CD?. As we have seen in Remark 25, w.l.o.g. we may assume that the 
former GCIs have the form (T E C’,n). 

Like for the crisp case, to cope with the non-termination problem, a notion 
of blocking has to be introduced, which is the same as for the crisp case: we 
say that a node v is directly blocked iff none of its ancestors is blocked and 
there exists an ancestor w such that 


L(v) = L(w) . 


In this case we say that w directly blocks v. A node v is indirectly blocked iff 
one of its ancestors is blocked. Finally a node v is blocked iff it is not a root 
node and it is either directly or indirectly blocked. 

Now, the calculus is as described above except that the rules are not applied 
to blocked nodes. Specifically, the rules are described in Table 10.8. As before, 
it can be shown that using the rules in Table 10.8 


Proposition 77. For each knowledge base K = (T, A) 
1. the tableau algorithm terminates; 


2. if the expansion rules can be applied in such a way that they yield a 
complete and clash-free completion-forest, then K has a finite model; 


9 Axioms of the form A=C are replaced with ALC and CLA. 
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TABLE 10.8: The tableau rules for fuzzy ALC with GCIs. 


— 


). If (i) (C1 Co, n) € Lv), (îi) [(C1,n), (Ca, n)} Z L(v), and (iii) node 
v is not indirectly blocked, then add (C,,n) and (C2,n) to L(v). 

). If (i) (C1 U Co, n) € L(v), (ii) {(C1, n), (C2, n)}NL(v) = O, and (iii) node 
v is not indirectly blocked, then add some (C,n) € {(C1, n}, (C2,n)} to 
L(v). 

(V). If (i) (VR.C,n) € Liv), (îi) (R,m) € L((v,w)) with m > 1—nm, (iii) 

(C,n) Z L(w), and (iv) node v is not indirectly blocked, then add (C, n) 
to L(w). 

3). If (i) (3R.C,n) € L(v), (ii) there is no (R,n1) € L((v, w)) with (C,na) € 
L(w) such that min(n;,n2) > n, and (iii) node v is not blocked, then 
create a new node w, add (R,n) to L((v, w)) and add (C,n) to L(w). 

(C). H(i) (TED NET, (ii) (D,n) g L(v), and (iii) node vis not indirectly 

blocked, then add (D,n) to £(v). 

(Č). If (i) CED €T, (îi) for some n € NE, ((nnf(50),1—n+e),(D,n)n 

L(v) = 9, and (iii) node v is not indirectly blocked, then add E to L(v) 

for some E € {(nnf(7C),1—n+ 6), (D,n)}. 


— 


| 
— 


3. if K has a model, then the expansion rules can be applied in such a way 
that they yield a complete and clash-free completion-forest for K. 


Example 37. Consider a fuzzy variant of Example 12. 
K = {(a:A, 0.7), (AE 3R.A,0.6)) . 


K is satisfiable as there is a clash-free and complete completion-forest (see 
Figure 10.7). Note that node y is blocked by x. 
The model build from the forest is: 


e A’ = {a,x}; 

e the interpretation of individuals is the identity function; 
e R* (a,x) = R? (x, x) = 0.6; 

e A*(a) = 0.7, A? (x) = 0.6. 


As for the crisp case, it is not our purpose to present here the tableau rules 
for more expressive languages of the fuzzy AL family. 

We refer the reader to, e.g., [389]. However, let us note that besides adding 
a rule for each construct, the blocking condition is extended as well to a more 
sophisticated definition, as, e.g., SHZF does not employ the finite model 
property anymore. 

Nevertheless, in Appendix H we recall the tableau calculus for fuzzy 
SHIF, under SFL. 
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a {(A,0.7), (HAUR.A,0.6)) 


((4,06), AU3R.A,08)) 
{(4,0.6), (AU 3R.4,04)) 


FIGURE 10.7: Clash-free and complete completion-forest from fuzzy 
ALC KB. 


10.6.1.3 Fuzzy Tableau for Finite-Valued DLs 


We have seen in Section 8.2.2.3, Algorithm 5, that in fact an analytical 
fuzzy tableau for propositional fuzzy logic can be adapted to the case in which 
the truth combination functions are defined via arbitrary matrix over a finite- 
truth space such as Ln = {0, =,- - -, 2=%, 1} (see Section 8.1.7). 

Next, we show that the tableau calculus illustrated in the previous Sec- 
tion 10.6.1.2 can be adapted to this case in an analogous way [73]. 

The tableau algorithm is quite similar than the one described for fuzzy 
ALC with GCIs, except that now all inference rules and the completion-forest 
initialization phase become non-deterministic. 

ia consider a fuzzy ALC KB with GCIs, truth-space Ln = 
10, —, pa nao , 1) and truth-combination functions &, 9, = and © defined 
over Ln. 

The notions of completion-forest, R,-successor, R,-predecessor, and an- 
cestor is as for Section 10.6.1.2. However, we will interpret (C,n) € L(v) as 
a statement of the form:“v is an instance of C to degree exactly equal to n”. 
The case for role expressions (R, n} € L((v, w)) is similar. The notion of clash 
is adjusted as for the fuzzy propositional case and is as follows. For nodes v 
and w, 


e L(v) is said to contain a clash iff L(v) contains either 


— an expression (1,n) with n > 0; or 


— a pair of expressions (A,n) and (4,m) with n 4 m; 
e L£((v,w)) is said to contain a clash iff £(v) contains 


— a pair of expressions (R,n) and (R, m) with n 4 m. 
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Now, the algorithm initializes a forest F non-deterministically as follows: 


e F contains a root node v¿, for each individual a; occurring in A; 


e F contains an edge (vå, v), for each ((a;,a;):R,n) € A; 


e for each (a;:C,m) € A, we add (C, m’) to L(vf,) for some m € Ln with 
m > m; 


for each ((a;,a;):R,m) € A, we add (R,m’) to L((uj,vj)) for some 
m € Ly, with m > m. 


Then the completion-forest F is then expanded by repeatedly applying the 
completion rules described in Table 10.9 and answers “K is satisfiable” iff 
the completion rules can be applied in such a way that they yield a com- 
plete and clash-free completion-forest. It can be shown that using the rules in 
Table 10.9 


Proposition 78 ([73]). For each knowledge base K = (T, A) with a finitely 
valued truth space Ln 


1. the tableau algorithm terminates; 


2. if the expansion rules can be applied in such a way that they yield a 
complete and clash-free completion-forest, then K has a finite model; 


3. if K has a model, then the expansion rules can be applied in such a way 
that they yield a complete and clash-free completion-forest for K. 


Of course, the method can be adapted to any truth space as well, as long as 
it is finite. 
In Appendix H.2 we extend it to fuzzy SHTF,. 


10.6.1.4 Operational Research-based Fuzzy Tableau 


Similarly as we have seen in Section 8.2.2.2 and Appendix F for the fuzzy 
propositional case, an important and application oriented approach for rea- 
soning with fuzzy DLs is based on Operational Research (OR) [47, 48, 51, 52, 
53, 55, 58, 59, 62, 398, 399, 419, 418, 425, 426). 

We will illustrate here the method under SFL. We refer to, e.g., [47, 53, 399] 
to other semantics. 

As for the analytical tableau case, we will assume here that all concepts 
are in NNF. 


The Case with Empty TBox. In the following, we will assume that the 
TBox in a knowledge base is empty. Like the tableau algorithm presented for 
crisp DLs such as [206, 209] our algorithm works on completion-forests since 
an ABox might contain several individuals with arbitrary roles connecting 
them. Our method combines appropriate DL tableaux rules, which constructs 
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TABLE 10.9: The tableau rules for finitely valued fuzzy ALC with GCIs. 


(FA). If (i) (C1 N C2,m) € Liv), (ii) there are m,,ma € Ln such that m 8 
ma = m with ((C¡,m1),(C2,ma)+ Z L(v), and (iii) node v is not 
indirectly blocked, then add (C1, mı) and (C2, M2) to L(v). 


(U). If (i) (C1 UC2,m) € Liv), (ii) there are m,,ma € Ln such that m O 
ma = m with {(C1, m1), (C2, M2)} NL(v) = Ø, and (iii) node v is not 
indirectly blocked, then add some (C, k} € {(Cy, m1), (C2, ma) to L(v). 

(>). If (i) (AC,m) € L(v) with (C,Sm) ¢ L(v) and (îi) node v is not indi- 
rectly blocked, then add (C, ©m) to L(v). 

(>). If (i) (Ci > Co,m) € Liv), (ii) there are m,,m2 € Ln such that 
mi => mo = m and {(C1, Mı), (C2, M2)}} Z L(v), and (iii) node v is 
not indirectly blocked, then add (C1, ma) and (C2, ma) to L(v). 


(V). If (i) (WR.C,m) € Lv), (ii) (R,mi) € L((v, w)), (iii) there is ma € Ln 
such that mı > ma > m with (C,ma) Z L(w), and (iv) node v is not 
indirectly blocked, then add (C,ma) to £(w). 


(3). If (i) (AR.C,m) € L(v), (ii) there are m1, ma € Ln such that mı 9 ma = 
m, (iii) there is no (R, mı) € L((v,w)) with (C,ma) € L(w), and (iv) 
node v is not blocked, then create a new node w, add (R, ma) to L((v, w)) 
and add (C, m2) to L(w). 


(3). If (1) (AR.C,m) € Liv), (îi) (R,ma) € L((v, w)), (iii) there is m2 € Ln 
such that ma 9 ma < m with (C,ma) € L(w), and (iv) node v is not 
indirectly blocked, then add (C, ma) to £(w). 


(V). If (i) (VR.C,m) € L(v), (ii) there are m,,ma € Ln such that mi => 
ma = m, (iii) there is no (R, mı} € L((v, w)) with (C,ma) € L(w), and 
(iv) node v is not blocked, then create a new node w, add (R, ma) to 
L((v, w)) and add (C,ma) to L(w). 

(E). If (i) (CE Dm) € T, (îi) there are m,,ma € Ln such that my > ma > 
m, (iii) {(C,m1), (D, m2}} Z L(v), and (iv) node v is not indirectly 
blocked, then add (C, ma) and (D, m2) to L(v). 

(Č). If (i) CED e T, (ii) there are mu, ma € Ly such that mu < ma, (iii) 
{(C,m1),(D,m2)} Z L(v), and (iv) node v is not indirectly blocked, 
then add (C, mı) and (D, mz) to L(v). 


the completion-forest with methods developed have seen in Section 8.2.2.2 and 
Appendix F. 

So, let K be a fuzzy KB with empty TBox. A completion-forest F for K 
is a collection of trees whose distinguished roots are arbitrarily connected by 
edges. Each node v is labelled with a set L(v) of concepts C € sub(K). If 
C € L(v) then we consider a variable x,.c. The intuition here is that v is 
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an instance of C to degree equal or greater than the value of the variable 
Ly:c. Each edge (v, w) is labelled with a set L((v, w)) of roles R € Rx and if 
R € L((v, w)) then we consider a variable «(,,.,):z representing the degree of 
being (v, w) and instance of R. 

If nodes v and w are connected by an edge (v, w) with R € L((v, wY) then 
w is called an R-successor of v. 

The forest has associated a set Cr of constraints c of the form L < V, l =l, 
where l,l’ are linear expressions. Essentially the constraints are added by 
the application of rules to concept expressions to reflect the semantics of the 
connectives. Intuitively, if for a node v, £(v) contains C1 U C2 and CF contains 
Ly:C,uC, > 0.7 (dictating that v is an instance of Ca U C2 to degree at least 
0.7), then the application of the (U) rule to C1 U Co will add the concepts Ci 
and Cə to £(v) and the constraints, encoding ty:c, ® Ly:C, > Lv:C,uC,, to Cr. 
Any assignment to the variables that satisfies these constraints, guarantees 
then also that indeed v is an instance of C1 U Ca to degree at least 0.7. 

We refer the reader to Section 8.2.2.2 and Appendix F for the encoding 
of the truth combination functions. In what follows, we will be using the 
expressions z < 11 Pro, 2 < zi 12 and z > zı & x2 both to denote 
the inequality with respect to a t-norm (t-conorm) operation and the set of 
constraints they are equivalent to. 

Now, given K = ((), A), the algorithm initializes a forest F as follows: 


e F contains a root node wv), for each individual a; occurring in A; 
e F contains an edge (vj, v3), for each assertion axiom ((a;,a;) : R,n) € A; 
e for each (a; : C,n) € A, we add both C to L(vp) and zic > n to Cr; 


e for each ((a;,a;) : Rn) € A, we add both R to L((vj,vj)) and 
Lyi vi):R > n to CF. 


The completion-forest F is then expanded by repeatedly applying the com- 
pletion rules described below. The completion-forest is complete when none 
of the completion rules is applicable. Then, the bMILP (bounded Mixed In- 
teger Linear Programming) problem on the set of constraints Cr is solved. 
We will show that if there is a solution to Cz then K is satisfiable and vice 
versa. 

We also assume a fixed rule application strategy as the order of rules 
below, such that the rule for (4) is applied as last. Also, all expressions in 
node labels are processed according to the order they are introduced into the 
completion-forest F. 

The rules for L-ALC are in Table 10.10. 

Note that, despite that |sub(K)| is O(|K]), we may well have that |Cz| is 
O(2I*1), as the completion-forest F may contain exponentially many nodes. 
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TABLE 10.10: The OR-based tableau rules for fuzzy ALC with empty TBox. 


(var). For variable t,.c occurring in CF add %,:0 € [0,1] to Cf. For variable 
L(y,w):R occurring in Cz add 2(,w):r € [0,1] to Cr. 
. If Le L(v) then Cr := CF U (zu =0). 
. If T € L(v) then CF :=CFU([zy. Tr = 1}. 
. If =A € L(v) then add A to L(v), and CF := CF U {£v:4 < 1— Lua). 
. If (i) C1MC¿ e L(v) and (îi) the rule has not been already applied to this 
concept, then add Cı and C2 to L(v), and CF := CF U {ty:0c, 8 Ly:c, > 
Ly:C04MCo }- 
(U). If (i) CIUC2 E L(v) and (ii) the rule has not been already applied to this 
concept, then add Cı and C2 to L(v), and CF := CF U {ty:c, O Lu:c, > 
Lv: UC» Ye 

(V). If (i) VR.C € L(v), Re L((v, w)) and (îi) the rule has not been already 
applied to this concept, then add C to £(w), and CF := CF U {tw:c > 
Lui Led 

(3). If (i) 3R.C € L(v) and (ii) the rule has not been already applied to this 
concept, then create a new node w, add R to £((v, w)), add C to L(w), 
and Cf := CF U {2w w)R Q Zw:C > Tv:IR.C}- 


Indeed, consider a knowledge base containing only the assertion a 
a:3R.C} N 3R.C4 TI 
YR.(IR.C? NJR.C? N 
VR.(AR.C3 N 3R.C3 M 


YR.(IR.C? NAR.CB)...). 


It is not difficult to see that the completion-forest F is a binary tree of depth 
n consisting of O(2”) nodes and that CF contains both O(2”) variables taking 
values in {0, 1} as well as O(2”) variables taking values in [0, 1]. In determining 
whether a MILP problem has a solution is an NP-complete problem [216] 
we get an NexpTime upper bound for the satisfiability problem (guess the 
assignment to the binary variables and solve the system of linear in-equations 
in exponential time). 
Now, it can be shown that 


Proposition 79 ([53]). For each knowledge base K = (0, A), 


1. the tableau algorithm terminates; 


214 Foundations of Fuzzy Logic and Semantic Web Languages 


2. if the expansion rules can be applied to a knowledge base K = (0, A) 
such that they yield a complete completion-forest F such that CF has a 
solution, then K has a (finite) model; 


3. if a knowledge base K = (0, A) has a (finite) model, then the application 
of the expansion rules yields a complete completion-forest for K such 
that Cz has a solution. 


Example 38. Consider Example 35. A complete completion-forest F is as for 
the crisp case (see Figure 4.2). The constraint set Cr contains the following 
inequations: 


TaNR.AMIR.B > 0-9, 

“(a,b):R 2 0.8, 

Tla, c):P z 0.7, 

Ta NR.A 8 Ta:AR.B 2 TaNR.ANIR.B > 
Tb:A 2 TA NR.A 8 T(a,b):R > 

Tla, x):R 8 Tr:B 2 %aAR.B > 

Ta:A 2 TaNR.A 8 Wa, a):R > 


from which we may build, e.g., the same model as in Example 35. 


The Case of Acyclic TBox. We next extend the previous algorithm to the 
case of acyclic TBoxes. 

Similarly to the analytical fuzzy tableau case, we present here the method 
based on lazy unfolding. We recall that A E, C is the same as (T E A > C,n) 
and A =, C is the same as (T E AO Cin) (note that A > C = AUC and 
AOC = ((A => C)N (C > A))) and, thus, as we have seen in Remark 25, 
w.l.o.g. we may assume that the GCIs in an acyclic TBox have the form 
(T E D,n), where D is of the form A> Cor AGC. 

Now, we extend our calculus in Table 10.10 with the rules in Table 10.11. 
As for Proposition 79, it can be shown that 


Proposition 80 ([53]). For each knowledge base K = (T, A) with acyclic T, 
1. the tableau algorithm terminates; 


2. if the expansion rules can be applied to a knowledge base K = (T, A) 
such that they yield a complete completion-forest F such that CF has a 
solution, then K has a (finite) model; 


3. if a knowledge base K = (T, A) has a (finite) model, then the application 
of the expansion rules yields a complete completion-forest for K such that 
Cr has a solution. 
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TABLE 10.11: The OR-based tableau rules for fuzzy ALC with acyclic fuzzy 
KBs. 


(Ca). Assume (T E D,n) € T, D is of the form A> Cor AGC, visa 
node to which this rule has not yet been applied and A € L(v), then 
add D to L(v) and set Cr := CF U {typ > n}. 


(ÉS) If ACC € T, vis a node to which this rule has not yet been applied 
and A € £(v), then add C to L(v) and set Cr := CF U {£v:A < Ly:c}. 


(=$) If A=C € T, v is a node to which this rule has not yet been applied 
and A € L(v) then add C to £(v) and set CF := CF U {£v:A = duo). 


=4) If AC € T, vis a node to which this rule has not yet been applied 
and A € L(v) then add nnf(=C) to L(v) and set CF := Cr Ultu:maA = 
Luar. 


Example 39. Note that by referring to Example 36, a complete completion- 
forest is as for Example 4.3 in which =B is added to the node label of x. The 
set of constraints can be worked out easily from Example 36, by adding 


TrBUuC 204, 
TB P Tr:C > Tz—BUuC > 
Ta: BS1=TgB > 


from which we may build, e.g., the same model as in Example 36. 


The Case with General TBox. Eventually, we show here how we may deal 
with the case in which the GCIs in the TBox are of the general form (C E D,n) 
and CD". As we have seen in Remark 25, w.l.o.g. we may assume that the 
former GCIs have the form (T E C’,n). 

Like for the crisp and analytical fuzzy table case, to cope with the non- 
termination problem, a notion of blocking has to be introduced, which is 
the same as for the crisp case: we say that a node v is directly blocked 
iff none of its ancestors are blocked and there exists an ancestor w such 
that 


In this case we say that w directly blocks v. A node v is indirectly blocked iff 
one of its ancestors is blocked. Finally a node v is blocked iff it is not a root 
node and it is either directly or indirectly blocked. 


10 Axioms of the form A=C are replaced with ALC and CLA. 
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TABLE 10.12: The OR-based tableau rules for fuzzy ALC with GCIs. 


(var). For variable t,.c occurring in CF add 2y:c € [0,1] to Cr. For variable 
L(y,w):R occurring in Cz add 2(,w):r € [0,1] to Cr. 

. If Le L(v) then CF := CF U {zv:1 =0). 

. If T € L(v) then CF := CF U {2.7 = 1}. 

. If ~A € L(v) then add A to L(v), and CF := CF U {ay.4 < 1 — Lua). 

. If (i) CLC, € L(v), (ii) the rule has not been already applied to this 
concept, and (iii) node v is not indirectly blocked, then add Cı and C2 
to L(v), and CF := CF U {£v:0; Q Lv:C, > Zv:CınC }- 

(U). If (i) Ci U C2 € L(v) and (ii) the rule has not been already applied to 

this concept, and (iii) node v is not indirectly blocked, then add Cı and 
Ca to L(v), and Cr = Cr U ten ® Ty:Ca 2 Ly:CyUCe t- 

(V). If (1) VR.C € Liv), Re L((v, w)) and (îi) the rule has not been already 
applied to this concept, and (iii) node v is not indirectly blocked, then 
add C to L(w), and CF := CF U ftua > lu vr.C 8 Lw w)RI 

(3). If (i) 3R.C € L(v) and (ii) the rule has not been already applied to this 
concept, and (iii) node v is not blocked then create a new node w, add R 
to L((v, w)), add C to £(w), and CF := CFU{ £ww): RI Tw:C > Lv:3R.C)- 

(E). If (i) (TED nm) €T, (îi) vis a node to which this rule has not yet been 
applied, and (iii) node v is not indirectly blocked, then add D to £(v) 
and set Cr := CF U {Xu:p = n}. 

(E). If (i) CED e T, (îi) v is a node to which this rule has not yet been 
applied, and (iii) node v is not indirectly blocked, then add nnf(7=C) 
and D to £(v) and set Cr := CF U {£v:nnf( 00) Di Lu:p > 1}. 


Now, the calculus is as described above except that the rules are not applied 
to blocked nodes. Specifically, the rules are described in Table 10.1211. 
As before, it can be shown that using the rules in Table 10.12 


Proposition 81. For each knowledge base K = (T, A) 


1. the tableau algorithm terminates; 


2. if the expansion rules can be applied to a knowledge base K = (T, A) 


li Note that in the (Č) rule we use the fact that CLD is the same as T E C > D, 
ie, TE =C L D. 
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FIGURE 10.8: OR-based complete completion-forest from fuzzy ALC KB. 


such that they yield a complete completion-forest F such that CF has a 
solution, then K has a (finite) model; 


3. if a knowledge base K = (T, A) has a (finite) model, then the application 
of the expansion rules yields a complete completion-forest for K such that 
CF has a solution. 


In Appendix H we recall the OR-based tableau calculus for fuzzy SHTF,y 
under SFL. 


Example 40. Consider Example 37. A complete completion-forest is as in 
Figure 10.8. 
It can be verified that the constraint set contains 


Za:A 29.7, 


Ta: AUIR.AZ06, 

Ta: A PARA 2 Ya SAURA > 
Ta:A Sl Za—A > 

Tla, x):R Ë Tr:A > TaAR.A > 


Tr: AUIR.AZ206, 
TA PIzARA > UL AURA > 
Te: AS 1 Eg A > 


Sy AU3R.A 206, 
Ty A Ply DRA 2 Uy: AUIRA > 
Ty:A <1- TymA ` 
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It can be verified that from this constraint set one may construct, e.g., a model 
as in Example 37. 


10.6.1.5 A Fuzzy Classification Algorithm 


As for the crisp DL case (see Section 4.4.1.4), we provide here also an ontology 
classification algorithm for the fuzzy AL-family. 

So, let X be the finite set of concept names we want to classify. At first, 
let us fix a t-norm 9 on which the subsumption relationship is based on. 

As in the fuzzy case, the subsumption relationship is graded, we are only 
interested in the case where the subsumption relationship is non-zero and 
maximal, 1.e., we consider a subsumption relationship of degree n among 
atoms A and B if n = bed(K,A E B) € (0,1]. So, let <"C X x X be 
such a graded subsumption relation among concept names, where A <” B iff 
n = bed(K, AE B) € (0, 1). 

The order is reflexive as for any a € X 


a<ta. 


The order is also max-® transitive (or simply transitive), i.e., 


a < c, where k = max{n 8 m | Ib € X st. a <” b <™ c}. 


Therefore, we call <” also a pre-order. Note that max-® transitivity ensures 
that we cannot have a <” b and a <™ b with n 4 m. Therefore, < can also 
be seen as a function X x X — (0,1]. Also, as for the classical case, <” is not 
antisymmetric and, thus, is not a partial order. Specifically, we may well have 
a <” band b <™ a with both a 4 band n Æ m. 

We call a,b € X equivalent (denoted a = b) iff both a <! band b <! a 
hold. We call a,b € X quasi-equivalent (denoted a =", b) iff both a <” b and 
b <™ a hold. Of course, equivalent concepts are quasi-equivalent (n = m = 1). 


Remark 34. Note that if a <™ ag <™ ... ST ap Sr aj then not 
necessarily all the a; are pairwise quasi-equivalent. In fact, consider ay <™ 
ag <" az <" a. Then one may think that aa = Gu: az, but this is true only 
if n2 9 nz #0, which is not always the case (e.g.,, ar SU! ag <°? az <03 ay 
and Lukasiewicz t-norm). However, if ® is strict then all the a; are pairwise 
quasi-equivalent. 


With eq(a) we denote the set of equivalent objects to a without a itself, i.e., 
eq(a) =4bla=b,a Hd). 
while with [a] we denote the equivalence class of a, i.e., 
la] = {bla =b}. 


Of course, eq(a) = [a] \ {a} and [a] = [b] if a = b. 
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Remark 35. Note thatc<” a iffc <” x for all x € [a] and, similarly, a <” c 
iff x <” e for all x € [a]. 


With qe(a) we denote the set of quasi-equivalent objects to a without a itself, 
1.€., 


ge(a) = {b | a =}, ba Hb). 


Remark 36. Note that b,c € qe(a) does not imply that b and c are quasi- 
equivalent. However, in general b € qe(a) iffa € qe(b). But, if ® is strict then 
b and c are quasi-equivalent (by Remark 34). 


Remark 37. Note that ifc £” a then it is still possible that c <™ x for some 
x € ge(a) and, similarly, ifa £” c then it is still possible that x <™ c for some 
x € qe(a). However, if ® is strict then c £” a iff c L” x for all x € qela) and 
similarly, a £” c then x <™ c for all x € qe(a) (the cases c <” a anda <” c 
are similar). 


Given the previous observations, we are going to represent <() via a possibly 
cyclic directed graph, where in a node we collect the equivalent concepts for 
a representative concept a, as for the classical case. There are cycles in the 
graph if quasi-equivalent concepts exist. 

We call <( a precedence relation of <(), iff «© is the smallest relation 
such that 


1. its reflexive, transitive closure is <C), except for pairs of equivalent ob- 
jects, i.e., pairs of objects a,b such that a = b; 


2. for quasi-equivalent concepts a,b, if a =", b then a <” b and b <” a. 


Note that if x <* y then x <* y and there is no z ¢ qe(x) U ge(y) such that 
x <” z <™ y for some n,m € (0, 1]. 

~< is intended to represent the concept name relationships in a TBox 7, 
which we want to compute. If x <” y then z is a successor of y and y is a 
predecessor of x. Similarly, if x <” y then x is an immediate successor of y 
and y is an immediate predecessor of x. 

A concept name hierarchy graph <C) is represented as a directed, labelled 
graph, where 


e there is a node a for an equivalence class [a], a € X, such that there are 
not two nodes a,b for equivalent concepts a and b; 


e a node a is labelled with the set eq(a) of logically equivalent concept 
names; 


e there is an edge e from node b to node a, labelled with degree n if a <” b. 


In few words: a node a is a place holder for the equivalence class [a], while an 


edge between a and b represents the subsumption relationship among a and 
b. 
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Note that a <” b iff x <” b for any equivalent x € eq(a). Similarly, a <” b 
iff a <™ x for any equivalent x € eq(b). 


A simple classification algorithm. Our algorithm proceeds similarly as for 
the classical case (see Section 4.4.1.4), but some special care has to be given 
to quasi-equivalent concepts and to cycles. 

So, assume we have determined the precedence relation <O for a set 
X; C X of concept names. Initially, Xo = {L, T} and L <$ T. Consider a 
concept name c € X \ X;. We determine a method to compute the precedence 
relation a, for set X ¡+1 = X; U {c}. To do so, we compute c’s immediate 
predecessors (procedure topFuzzySearch), TX? as from Equation (4.4), and 
immediate successors (procedure bottomFuzzySearch), BX? as from Equa- 


tion (4.5), w.r.t. concept names in X; and determine 20, as 


4 = (S1US2U 83) \ Sa 
where 
Si = ZO 
S2 = fe=”"plpeTX¿) 
S3 = {s~<"c|se€ Bă) 
Sa = (s=<*p|se BXf,p € TX?,s,p not quasi-equivalent, 


s=<”c<” pandk=n8m). 


As for the classical case, there is a special case if c is equivalent to some 
ze Xj, ie, c <! x and x <! c. In that case, TXf = {x} dictating c <” x. In 
this case, if n = 1 we test also if x <! c. If the test is positive, c is equivalent 
to x and we add c to eq(x). 

The procedure SimpleFuzzyClassify(K) is as follows: 


SimpleFuzzy Classify (KX) 
1. Let 


(a) X be the set of concept names in T 


(b) Xo := {L,T}. Create two nodes L and T with empty label, 
and add an edge dictating 1 <! T 


If (X \ X;) =0 return (X;, <;) and exit 
Select ce X \ X; 
Compute T X$ 


If TX? = {x} and c <t z, then test x <! c and if test is positive 
add c to eq(x), and go to step 2. 


6. Compute BXf 


FL 20 O 
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7. Set 
Xi+1 = Xi U {c} 
4, = (S/US¿US3) Sa 
where 

Sy = =) 

Sa = fe=”"plpeTX¿) 

S3 = {s<"c|se BxX;} 

Sa = {s~¥p|s € BXf,p € TXf,s,p not quasi-equivalent, 


s=<"c<” pandk=n8m). 


Note that c <” p is given by the value n := SubsFlag(c, p), 
where the variable SubsFlag(.,.) is updated in the procedure 
enhancedFuzzyT opSubs. 


8. Increment ¿ and go to step 2. 


We next show how to compute T'Xf and BXf in the fuzzy setting. We first 
address T.X°. The other case is dual. 

To start with, we assume that the elements in X; are represented as a 
concept hierarchy, as explained previously. To determine T7'Xf, our procedure, 
called topFuzzySearch, starts with T € X; and visits the concept hierarchy X; 
in a top-down breadth-first fashion. That is, 


TX; :=topFuzzySearch(c, T) . 


Some care has to be given to cycles to avoid visiting the same node 
twice. 


For each concept name x € X; under consideration, it determines whether x 
has an immediate successor y satisfying c <” y. If there are such successors, 
they are considered as well. Otherwise, x is added to the result list of the 
topSearch algorithm. 

To avoid multiple visits of elements of X; and multiple comparisons of 
the same element c, topFuzzySearch employs the label “visited” and another 
label n € [0,1] if the subsumption test has been made and the degree of 
subsumption is n. 

The procedure topFuzzySearch gets two concepts as input, the concept 
c, which has to be inserted and the element x € X; currently under con- 
sideration. For x, we already know that c <” x and we look at direct suc- 
cessors of x w.r.t. a For each direct successor y of x, we have to check 
to which degree n, € [0,1], y subsumes c. This is done with the procedure 
enhancedF'uzzyT opSubs. The direct successors y for which the test was pos- 
itive (ny > 0) are collected in a list PosSucc. If the list remains empty, x is 
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added to the result list; otherwise topFuzzySearch is called for each positive 
successor, if not already visited. 
The topFuzzySearch(c, x) algorithm is as follows: 

topFuzzySearch(c,x) 

1. Visited(c, x):= true 

2. For all y with y < x do: if enhancedFuzzyTopSubs(y, c) then 

PosSucc := PosSuccU {y} 
3. If PosSucc = Ú then Result := {x} and go to step 5. 


4. For all y € PosSucc do: if not Visited(c, y) then Result := ResultU 
topFuzzySearch(c, y) 


5. Return Result. 
The enhancedFuzzyTopSubs(y, c) algorithm is as follows: 
enhancedFuzzyTopSubs(y,c) 


1. If SubsFlag(y,c) > 0 then Result := true and go to step 6. 
2. If SubsFlag(y,c) = 0 then Result := false and go to step 6. 
3. If Q is a strict t-norm then 


e iffor all z with y <; ” zit holds that enhancedFuzzyT opSubs(z, c) 
and c <” y then SubsFlag(y,c) := n, Result := true and go 
to step 6. 


4. If Q is not a strict t-norm and c <” y then SubsFlag(y,c) := 
n, Result := true and go to step 6. 


5. SubsFlag(y, c) := 0, Result := false 
6. Return Result. 


Note that in step 3, before testing the subsumption c <” y we apply the 
following heuristics using negative information [20, 22, 23], which works for 
strict t-norms only: 


If Q is a strict t-norm and if for some predecessor z of y the test 
c <™ z has failed, we can conclude that c £” y without performing 
the expensive test c <” y. To gain maximum advantage, all pre- 
decessor of y should have been tested before the test is performed 
on y, which is obtained by recursive calls. 


We next show how to compute BX’, which is dual to the topFuzzySearch 
procedure and is performed by the bottomFuzzySearch algorithm. We have 
that 

BX; := bottomFuzzySearch(c, L) 


bottomFuzzySearch(c,x) 
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1. Visited(c, x):= true 

2. For all y with x <? y do: if enhancedFuzzyBottomSubs(c, y) then 
PosPrec := PosPrecU {y} 

3. If PosPrec =) then Result := {x} and go to step 5. 


4. For all y € PosPrec do: if not Visited(c, x) then Result := ResultU 
bottomFuzzySearch(c, y) 


5. Return Result. 
The enhancedFuzzyBottomSubs(c, y) algorithm is as follows: 


enhancedFuzzyBottomSubs(c,y) 


1. if SubsFlag(c, y) > 0 then Result := true and go to step 6. 
2. If SubsFlag(c, y) = 0 then Result := false and go to step 6. 
3. If Q is a strict t-norm then 


e Iffor all z with z =; 2 y it holds that enhancedFuzzyBottomSubs(c, z) 
and y <” c then SubsFlag(c, y) := n, Result := true and go 
to step 6. 


4. If Q is not a strict t-norm and y <” c then SubsFlag(c, y) := 
n, Result := true and go to step 6. 


5. SubsFlag(c, y) := 0, Result := false 
6. Return Result. 


Note that in step 3, before testing the subsumption y <” c we apply the 
following heuristics using negative information [20, 22, 23], which works for 
strict t-norms only: 


If & is a strict t-norm and if for some successor z of y the test z <” 
c has failed, we can conclude that y £” c without performing the 
expensive test y <” c. To gain maximum advantage, all successors 
of y should have been tested before the test is performed on y, 
which is obtained by recursive calls. 


Example 41. Consider a fuzzy analogue of Example 13 under product logic: 


K = {(C CAR.D,0.7), GRE E,0.8)} . 
Figure 10.9 illustrates a classification run. 
Optimising fuzzy classification via told subsumers. We extend the no- 


tion of a told subsumer introduced in Section 4.4.1.4 to the fuzzy case. Con- 
sider a KB K = (T, A). 


1. If 7 contains (A E C,m) then C is called a told subsumer of A to degree 
m, denoted A >? C. 
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FIGURE 10.9: A fuzzy DL classification run. 


2. If T contains A = C then C is called a told subsumer of A to degree 1, 
denoted A >}, C. 

3. H(DEC¡M...MC,,,m) € T then for all i, D >% Ci. 

4. HA=C,M...MC, € T then for all i, A >!, Ci. 

5. If (Ci UU... UC, E D,m) € T then for all i, Ci > D. 

6. If A=C,U...UC, € T then for all i, Ci >}, A. 

T. ts is max —®-transitive. 


We say that B is a told subsumer of A iff A >?, B with n > 0. Note that if 
® is strict and A >}, B then n > 0 and, thus, B is a told subsumer of A. 


Remark 38. Note that if A >}, B withn > 0 we know that K = (AE B,n). 
Unfortunately, n is not necessarily the maximal degree of subsumption, e.g., 


for 


(AC B,0.5) 
(AE C1) 
(CHDEB,1) 
D=T 

we have that A =>}? B, though K F (AE B,1). 


Now, the analogous of Remark 7 is not true. 


Remark 39. Before classifying concept A, all of its told subsumers which have 
already been classified, and all their subsumers, can be marked as subsumers 
of A, though the maximal degree of subsumption may not be known, without 
the subsumption test. That is: 


if A =>. B with n > 0 then we know that A<” B with m > n. 
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However, the analogous of Remark 8 is true. 


Remark 40. In procedure SimpleFuzzyClassify, in step 3, select concept 
cE X \ X; only if all of its told subsumer have been classified. That is, 


if A >ù, B with n > 0 then classify B before A 


Therefore, we get a preference order on the atoms in X. 


Fuzzy acyclic KBs case. This is like the crisp case: consider an acyclic KB 
K = (T, A). For acyclic KBs we may take advantage of the heuristics devel- 
oped in the previous paragraph concerning told subsumers and the following 
remark that works for acyclic KBs only. 


Remark 41. Before classifying concept A, all of its told subsumers which have 
already been classified, and all their subsumers, can be marked as subsumers 
of A. Subsumption test for these atoms are, thus, unnecessary. That is, in the 
procedure SimpleFuzzyClassify, we have another initialization step: 


if A >p B with n > 0 then set SubsFlag(B, A) =n. 
So, for acyclic KBs if A >f, B with n > 0 then A <” B holds. 


Specifically, apply the SimpleFuzzyClassify procedure with the two heuris- 
tics described in Remark 41 and 40. Then the ordering induced by Remark 40 
can be exploited by omitting the bottom search phase for primitive concept 
names and assuming that they only subsume (concepts equivalent) to L. This 
is possible because, with an acyclic KB, a primitive concept can only subsume 
concepts for which it is a told subsumer. Therefore, as concepts are classified 
in definition order, a primitive concept will always be classified before any of 
the concepts it subsumes (and, thus, the bottom search phase is redundant). 


10.6.2 The Case of Fuzzy EL 


We describe here a polynomial fuzzy classification algorithm for fuzzy ELT 
(see Section 4.2.2.4 for the definition of crisp ELT) under Gödel logic [388, 
475], which is the only reasoning algorithm within the EL family known so 
far, though it is expected that fuzzy calculi involving different t-norms and/or 
fuzzy calculi in the style of [229, 230, 243, 244, 245, 297] (see Remark 9) will 
appear in the near future. 

So, in EL* concepts have the form as for EL, namely 


CID > A | (atomic concept) 
T | (universal concept) 
CND | (concept conjunction) 
3R.C (qualified existential restriction) 


General inclusion axioms are of the form (C E D,n), where C,D are EL? 
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TABLE 10.13: Normalization rules for fuzzy €L* TBoxes. 


NF1 Rio: oR, E S> {Ri o---o Rk-1 CU,U OR, CS} 


ARC E Dyn) > (CE A,n), ARAC D,n)} 


BCAR.C,n) > ((B E 3R.A,n), (AE C,n)} 


( 
( 
NF4 (ĈE D,n) 4 (CE A,n), (AC D,n)) 
( 
( 


BECNODna->(BEC,BED nm) 


where C,D g CN}, BE CN? U new role name, and A new concept name. 
T Ti 


concepts, while role inclusion axioms are of the form R,---R, E R, where 
R; and R are role names (note that the degree in role inclusions axioms is 1, 
i.e., role inclusion axioms are classical). 

Now, before applying the polynomial algorithm for classification, a fuzzy 
ELT TBox needs to be normalized [15, 25] in a similar way as we did for EL ++ 
in Section 4.4.2 (see also [388]). Specifically, let CNF (resp. CN7) denote the 
set of concept names occurring in 7 with (resp. without) the top concept T. 
We say that a fuzzy EL* Tbox T is normalized if 


1. it only contains GCIs of the following form: 


(AN... MAE Bin), (ACAR.B,n), (AR.ACB,n), 


? 


where A, A; € CN} and B € CNT. 


2. all role inclusions are of the form RE S or Rio RoE S. 


As shown in [388], ant fuzzy EL* TBox T can be turned in linear time into a 
satisfiability preserving normalized TBox 7” by exhaustively applying proper 
normalization rules (see Table 10.13), which introduce new concept and role 
names in the TBox. 

As next, we assume that a TBox is normalized. Similarly to crisp EL**, 
when developing the subsumption algorithm for normalized €£*+ TBoxes, we 
can restrict our attention to subsumption between concept names. In fact, 7 = 
(CE D,n) iff T' E (AC B,n), where T' = TU{(AL Cn), (DE B,n)}, 
with A, B new concept names. Now, let Ry to denote the set of all role names 
used in 7. 

The algorithm computes a mapping S from CN7 to a subset of CN} x 
(0, 1], and a mapping R from Ry to a subset of CN} x CN} x [0, 1]. The intu- 
ition is that these mappings make implicit subsumption relationships explicit 
in the following sense: 


AMADOR nC. E C,n) > {(ČE A,n), (010... A... Cee nit 
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TABLE 10.14: Completion rules for fuzzy EL*. 


(CRI or 1 < i < k, (Anni) € S(X), (Ain... A, E B,n) € T, and 


) Iff 
(B,m) £ S(X), where m = min(n;,...nz,n) then add (B, m) to S(X); 
) 1 


(CR2 


f (A,n) e S(X), (AL AP.B,k) € T, and (X,B,m) ¢ R(P), where 
m = ani ali k) then add (X, B,m) to R(P); 


(CR3) E (X,Y ni) € R(P),(A,n2) € SM), EAP.AE B,n3) € T, and 


) 
,m) € S(X), where m = min(n;,n2,n3) then add (B, m) to S(X); 
(CRA) If (X,Y,n) € R(P),P E Se T, and (X,Y,n) ¢ R(S) then add 
X,Y,n) to R(S); 

I 


(B 

( 

(CR5) f (X, Y, n1) € R(R1), (Y, Z, n2) € R(Ra), Ri o ho E R3 € T, and 
(X,Z,m) £ R(R3), where m = min(n,,na2) then add (X,Z,m) to 
R(R3). 


e (B,n) € S(A) implies that 7 E (AE B,n); 


e (4,B,n) € R(P) implies T — (A E 3P.B,n). 


These mapping are initialized as follows: 
e S(A) :=((4,1),(T1,1)) for each A € CN7; 
e R(P) :=0 for each Pe R7. 


Then, the sets S(A) and R(P) are extended by applying the completion rules 
shown in Table 10.14 until no more rules are applied. 

Now, it can be shown that by appropriately optimizing the application of 
the completion rules (see [15, 25, 388]) 


Proposition 82 ([388]). For a normalized EL* TBozx T, 


1. the rules of Table 10.14 can only be applied a polynomial number of 
times, and each rule application is polynomial; 


2. Let S be the mapping obtained after the application of the rules of Ta- 
ble 10.14 to T has terminated, and let A, B be concept names occurring 
in T, then for n € (0,1] we have that T = (AE B,n) iff (B,n’) € S(A) 


for some n! > n. 


Example 42. Let us consider K in Example 41 under Godel logic. 
Note that K is already normalized. The following illustrates a classification 
TUN. 
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FIGURE 10.10: A fuzzy EL ** classification example. 


0 


1. Initialization: 


S(C) T UC, 1), (T, 1 ) 
S(D) := {(D,1),(T, 
S(E) = UE, 1), (T, ) 
R(R) = 0. 


2. Application of rule (C R2): 
R(R) := R(R) U {(C, D,0.7)} . 
3. Application of rule (C R3): 


S(C) := S(C) U {(E,0.7)} . 


4. Stop. Figure 10.10 illustrates the final classification, which coincides with 
the one in Figure 10.9, except to the subsumption degree among E and 


C. 


10.6.3 The Case of Fuzzy DL-Lite 


In Section 4.2.2.5 we described the crisp DL-Lite family, while in Sec- 
tion 4.4.3 we described a conjunctive query answering procedure for crisp 
DL-Litecore. As query answering is the major reasoning task of the DL- 
Lite family, we do so here the same and describe indeed a top-k query an- 
swering procedure of DL-Litecore under Gödel logic with involute negation 
and which find its application in [344, 99, 420, 430, 431, 432, 435] (see, 


Fuzzy OWL 229 


e.g., [289, 342, 343, 404, 410, 424, 434] for similar reasoning algorithms re- 
lated to the DL-Lite family). 

So, we recall that in fuzzy DL-Litecope concepts and roles have the following 
syntax 


B — A|AR 
C — B|-B 
R — P|. 


Fuzzy inclusion axioms are classical, i.e., of the form 


BEC 


and fuzzy assertion axioms are, as usual of the form (a:A,n) and ((a, b):P,n). 
As for the crisp case (see Section 4.4.3), we have a normalization step. 


Knowledge Base normalisation. That is, the normalisation of K = (T, A) 
is obtained by transforming K as follows. The ABox A is expanded by adding 
to A the assertions (a:3R,n) and (b:3R7, n), for each ((a,b):R,n) € A. From 
A we remove all fuzzy assertions (a:B,n) if there is another (a:B,m) in A with 
degree m > n. We do the same for fuzzy role assertions. Then 7 is expanded 
by closing it with respect to the following inference rule: if By E By € T and 
either Bə E 5B3 € T or B3 E Bo e T then add Bı E ~B; to T. It can be 
shown that after computing the above closure we have that 7 = Bı E Ba 
iff either Bı E =B2€ T or Bz CAB, ET. 

Now we store A in a relational database. That is, (1) for each basic concept 
B occurring in A, we define a relational table tabg of arity 2, such that 
(a,n) € tab g iff (a:B,n) € A; and (ii) for each role P occurring in A, we define 
a relational table tabp of arity 3, such that (a,b, n) € tabp iff ((a,b):P,n) € A. 
We denote with DB(A) the relational database thus constructed. 


Knowledge Base satisfiability. Before we start to query the KB, once and 
for all, we check whether the KB is satisfiable. To check the satisfiability 
of a normalized KB K = (7,A), we verify the following condition: there 
exists Bı E 5B2 € T and a constant a such [(a:B,,n),(a:B2,m)) C A with 
n > 1— m. If this condition above holds, then X is not satisfiable. Otherwise, 
K is satisfiable. 

Note that the algorithm can verify this condition by posing to DB(A) a 
simple fuzzy conjunctive query expressed in SQL query, i.e., K is not satisfiable 
iff 


q(x) — tabg, (x, $1), tabs, (£, $2), si > 1— s2 


has a non-empty answer in DB(A). As for the crisp case, it can be shown 
that the above described algorithm decides the KB satisfiability problem in 
polynomial time. 


Top-k Query answering. To determine the answers of a top-k conjunctive 
query over K is essentially similar to the case of the crisp case. To this end: 
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1. We have to check if X is satisfiable, as querying a non-satisfiable KB is 
undefined in our case. 


2. By considering 7 only, the user query q is reformulated into a set of 
fuzzy conjunctive queries r(q, 7). 


3. The reformulated queries in r(q,T) are then evaluated over A only 
(which is stored in a database), producing the requested answer set 
ans(K, q). 


Now, let us consider a fuzzy conjunctive query without ranking aggregation 
functions. The procedure is similar to the crisp case. Specifically, we have the 
following. An axiom 7 is applicable to an atom (B(x), s) in a query body, if 
7 has B in its right-hand side, and 7 is applicable to an atom (P(x1, 22), s) 
in a query body, if either (i) z2 = - and the right-hand side of r is IP, or 
(ii) x, = - and the right-hand side of 7 is JP~. We indicate with gr(g;7) 
the expression obtained from the atom g by applying the inclusion axiom 7. 
Formally, if g = (Bi (x), s} (resp., g = (P,(x,-),s) or g = (Pi(.,2),s)) and 
T = B E Bi (resp., 7 = B2 E 3P, or T = B2 E JP; ), we have: 


e gr(g, T) = (A(x), s}, if B2 = A, where A is an atomic concept; 


© gr(g,T) = (Po(z, -); 5), if Bə = 3P»; 


bs gr(g,T) = (P3(=, £), 8), if Ba A APs. : 


Now, the query reformulation algorithm, that given a conjunctive query q and 
a set of axioms 7, reformulates q in terms of a set of conjunctive queries 
r(q,7), which then can be evaluated over DB(A) is exactly as for the crisp 
case (see Algorithm 1) and we have that 


ans (K,q) = Top, {(t, A) | q; Er(q,T), AF (qilt), A)} - 


The above property dictates that the set of reformulated queries q; € r(q, T) 
can be used to find the top-k answers, by evaluating them over the set of 
instances A only, i.e., over the database, without referring to the ontology T 
anymore. As for the crisp case, note, however, that the size of r(q, T) may be 
exponential w.r.t. T. 

In the following, we show how to find the top-k answers of the union of 
the answer sets of conjunctive queries q; € r(q, T). 

A naive solution would be: we compute for all q, € r(q,7) the whole 
answer set ans(q;, A) = {(t, A) | A H (a(t), A)}, then we compute the union, 
Ug.er(a.7) ans(q;, T), of these answer sets, order it in descending order of 
the scores and then we take the top-k tuples. We note that each conjunctive 
query q; € r(q, T) can easily be transformed into a top-k SQL query expressed 
over DB(A), i.e., the database encoding A. The transformation is conceptually 
simple. 
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Example 43 ([410]). Let us consider Example 15. We may ask to find cheap 
accommodations, their location, and price. Such a query may be expressed as 


(q(a, hl, p), s) <-Accomodation(a), HasHLoc(a, hl), 
HasHPrice(a, p), s := 1s(50,80)(p) . 


Then the following is the set of query rewritings r(q, T): 


(q(a,hl,p),s) <= Accomodation(a),HasHLoc(a, hl), 
HasHPrice(a, p), s := ls(50,80)(p) 


(q(a,hl,p),s) + Hotel(a),HasHLoc(a, hl), 
HasHPrice(a, p), s := ls(50, 80) (p) 


(q(a,hl,p),s) +  Hotelx(a), HasHLoc(a, hl), 
HasHPrice(a, p), s := ls(50, 80) (p) 


(q(a,hl,p),s) <  Hotelxxxxx(a), HasHLoc(a, hl), 
HasHPrice(a, p), s := ls(50, 80) (p) . 


A major drawback of this solution is the fact that there might be too many 
tuples with non-zero score and hence for any query q; € r(q,O), all these 
scores should be computed and the tuples should be retrieved. This may not 
be feasible in practice [424]. Indeed, in [424] it is shown that in some cases 
this approach didn’t work at all due to main memory problems and motivated 
the algorithm described next, called Disjunctive Threshold Algorithm, initially 
proposed in [410], that does address the case for top-k join queries (queries 
without ranking aggregates) and then has been extended to the case of queries 
with ranking aggregates in [424]. 


The DTA for top-k join queries. An immediate method to compute 
ans (KC, q) for top-k join queries is to compute for all q; € r(q, T), the top-k 
answers ans (A, qi). If both k and the number, ng = |r(q, T)|, of reformulated 
queries are reasonable, then we may compute the union, 


U(q,K)= |] ansi(A,q), 


qiEr(q,T) 


of these top-k answer sets, order it in descending order w.r.t. score and then 
we take the top-k tuples. 

As an alternative, we can avoid computing the whole union U (q, XK), so 
further improving the answering procedure, by relying on a disjunctive vari- 
ant [410] of the so-called Threshold Algorithm (TA) [152], called Disjunctive 
TA (DTA). 
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We recall that the TA has been developed to compute the top-k answers 
of a conjunctive query without joins and with monotone score combination 
function. In the following we show that we can use the same principles of the 
TA to compute the top-k answers of the union of conjunctive queries without 
ranking aggregates: 


1. First, we compute for all q, € r(q, T), the top-k answers ansg(A, qi), us- 
ing top-k rank-based relational database engine. Now, let us assume that 
the tuples in the top-k answer set ans;(A,q:) are sorted in decreasing 
order with respect to the score. 


2. Then we process each top-k answer set ansz(A,qi) (q; € r(q,T)) ac- 
cording to some criteria (e.g., in parallel, or alternating fashion, or by 
selecting the next tuple from the answer set with highest threshold 9, de- 
fined below), and top-down (i.e., the higher scored tuples in ans, (A, qi) 
are processed before the lower scored tuples in ans, (A, q;)). 


(a) For each processed tuple t, if its score is one of the k highest we 
have already computed, then remember tuple t and its score s 
(ties are broken arbitrarily, so that only k tuples and their scores 
need to be remembered at any time). 


(b) For each answer set ans(A,q,), let 6; be the score of the last tuple 
processed in this set. Define the threshold value 8 to be 


0 = max(91,...,0,) - 


(c) As soon as at least k tuples have been processed whose score is at 
least equal to 6, then halt (indeed, any successive retrieved tuple 
will have score < 0). 


(d) Let Y be the set containing the k tuples that have been processed 
with the highest scores. The output is then the set {(t, s+) | t € Y]. 
This set is ansz(K, q). 


It is not difficult to see that the DTA determines the top-k answers. Indeed, 
if at least k tuples have been processed whose score is at least equal to 0 then 
any new not yet processed tuple t will have score bounded by @ and, thus, it 
cannot make it into the top-k. Hence, we can stop and the top-k tuples are 
among those already processed. 


Example 44 ([424]). Suppose we are interested in retrieving the top-3 an- 
swers of the disjunctive query q that has been rewritten as {q3,q4,q"}. We 
have seen that it suffices to find the top-3 answers of the union of the an- 
swers to q3,q4 and to q”. Let us show how the DTA works. First, we submit 
q3,q4 and q” to a rank-based relational database engine, to compute the top-3 
answers. Let us assume that 


ans3lF,q3) =  [(0,1.0), (3,0.7), (4, 0.6)] 
anss(F,q4) = [(1,0.9), (2, 0.8), (5, 0.5)] 
anss(F, q") = [(2,0.84), (3, 0.64), (4, 0.36)] 
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The lists are in descending order w.r.t. the score from left to right. Now we 
process alternatively ans (F, q3), then ansk(F, q4) and then ansk(F, q") in 
decreasing order of the score. The table below summarizes the execution of our 
DTA algorithm. The ranked list column contains the list of tuples processed. 


Step | Tuple | Oas Oa Oar 0 Ranked List 
1 (0,1.0) [| 1.0 - - - (0, 1.0) 
2 (1,0.9) [| 1.0 09 - - (0, 1.0), (1, 0.9) 
3 | (2,0.84) | 1.0 0.9 0.84 | 1.0 | (0,1.0), (1,0.9), (2, 0.84) 
4 3,0.7 0.7 0.9 0.84 | 0.9 | (0,1.0), (1,0.9), (2, 0.84) (3, 0.7 
5 (2, 0.8) | 0.7 0.8 0.84 | 0.84 | (0,1.0),(1,0.9), (2, 0.84) (3, 0.7) 


At step 5 we stop as the ranked list already contains three tuples 
above the threshold 6 = 0.84. So, the final output is ansk(F,q3) = 
[(0, 1.0), (1, 0.9), (2, 0.84)]. Note that not all tuples have been processed. 


Proposition 83 ([410, 424]). Given K = (4,7) and a conjunctive query 
q without ranking aggregates, then the DTA determines ansk(K, q) in linear 
time w.r.t. the size of A. 


Proposition 83 extends also to the case of disjunctive queries, to DL-Litez n, 
DL-Liter np, DLR-Lites n and DLR-Liter n [424]. 

We remark that in case a ranking aggregate occurs in a query, the DTA has 
to be modified as illustrated in [424]. We do not report it here and refer the 
reader to [424] for an in-depth description and its experimentation. We just 
recall that Proposition 83 extends also to disjunctive queries, to DL-Litez,n, 
DL-Liter n, DLR-Lites n, and DLR-Liter n, in which a ranking aggregate 
@ e (MIN, AVG, SUM, MAX} may occur in a query. 


10.6.4 The Case of Fuzzy Horn-DLs 


In Section 4.4.4, we have seen that reasoning in Horn-DLs may be per- 
formed by a reduction to logic programming. We will show here that this is 
indeed the case as well as for fuzzy Horn-DLs under Gédel logic (for other 
logics the transformation still needs to be worked out). Specifically, we show 
that a fuzzy Horn-DL KB K can be reduced to a reasoning preserving fuzzy 
logic program Px (fuzzy logic programs are described in Chapter 11). 

So, in the following consider a fuzzy Horn-DL based on the Gödel t-norm 
8 and let K = (T, A) be a KB in which concepts have the form 


B => A|B, Bə | Bı U Bə | 3R.B 
R => P|P- 


where for n € (0, 1] inclusion axioms have the form 


(BEC,n) 
(Ry E Ra,n) 
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and fuzzy assertions are of the form (a:C,n), ((a, b):R,n). 

Our mapping to fuzzy logic programming follows close the one described 
in Section 4.4.4. 

We anticipate informally (see Chapter 11), that a fact in fuzzy logic pro- 
gramming is of the form 

P(C1,...,Cpn) EN, 

where p is an n-ary relation, every c; is a constant, and n > 0 is a rational 
in [0, 1]g (meaning that the degree of truth of p(c1,..., Cn) is no less than n), 
while a fuzzy rule (to what concerns us here) is of the form 


P(x) — f(prlz1),---,Pnl[Zx)) , 
where f is a k-ary monotone function f : [0, 1]*>1%1, whose meaning is 


“if n; is the degree of truth of p;(z;) then the degree of truth of 
p(x) is no less than f(n1,..., ng)”. 


We now define a recursive mapping function o which takes a inclusion axioms 
and assertions and maps it into the following expressions: 


o((Ri¡ E R2,n)) > Orote( Re, 2,Y) E Crote(Ri, 2, y) @n 
Orote(R, x,y) = R(z,y) 
o,(R-,2,y) => Rly,x) 
o((BEC,n)) > orn(C,2) oy (B)On 
on(A,x) > Alu) 
on(C11C2,2) > on(Ci, x) Ao. (Ca, x) 
on YR.C, £) > on(C,x) E Orole(R, x,y) 
ol A, £) => A(z) 
o(Cı NC2,£) => lC, 2) 8 (C2, £) 
(Cı U C2,£) => lOi, 2) 9 lC, x) 
oy(AR.C,2) > Grote(R, £, y) ® (C, y) 
olla:C,n)) = oní[Cja) =n 
o(((a,b):R,n)) => Grole(R,a, b) n 


where y is a new variable. 
We then transform the above generated expressions into rules by applying 
recursively the following mapping: 


or((H A H') + B) > o (H+<B),0,(H' + B) 
o,((H + H')+ B) > o (H<(BAH”) 
o,(H s= (Bı V B2)) = or (H ss Bı), o, (H —= Ba) 
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Eventually, if none of the above three rules can be applied then 


o(H+B) >» H&B. 


For instance, the GCI 


(AN3R.CEBNVP.D,n) 
is first mapped into the expression (via 0) 
B(x) A (D(x) € P(a,z)) — A(x) 8 R(x,y) o Cly) 8 n 
that then is transformed into the two rules (via 0) 


B(x) + A(x) @ R(x, y)@Cly) 9 n 
D(z) «© A(x) ® R(x, y) @Cly) 9 P(a,z) @n. 


It can be shown that 


Proposition 84. The above described transformation preserves semantic 
equivalence. That is, let K be a fuzzy Horn-DL KB and Px be the rule set 
that results from applying the transformation to all axioms in K, then Px is 
logically equivalent to K w.r.t. the semantics of FOL Px has the same set of 
models and entailed conclusions as K. 


Hence, reasoning in fuzzy Horn-DL can be reduced to reasoning within fuzzy 
logic programming (see Chapter 11). 


10.6.5 The Case of Concrete Domains and Aggregation 
Operators 


A major ingredient of fuzzy DLs is the capability to deal with fuzzy con- 
crete domains and aggregation operators (see Section 8.2.3 for the fuzzy propo- 
sitional case), as illustrated in Section 10.3. 

In Section 8.2.3 we have shown that indeed one may reason in the presence 
of concrete domain and aggregation operators by relying on an OR-based de- 
cision procedure, by mapping the constraints imposed by the concrete domain 
and aggregation functions into set of inequations (see Appendix F). It is, thus, 
not surprising that this approach may be followed also in the case of fuzzy 
DLs [48, 51, 53, 58, 59, 398, 399]. We refer the reader to Appendix H.4 for the 
detailed description of the inference rules. 

We also remark that under SFL one may also map fuzzy DLs with concept 
modifiers and concrete fuzzy concepts d € {rs(a, b), tri(a, b,c), trz(a, b,c, d)} 
into crisp fuzzy DLs, as illustrated in [45] and this is the method followed by 
the DELOREAN [42] reasoner. However, it is unknown whether this approach 
can be adapted to aggregation operators. 


Taylor & Francis 
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Chapter 11 


Fuzzy Rule Languages 


11.1 Introduction 


In logic programming, the management of imperfect information has at- 
tracted the attention of many researchers and numerous frameworks have 
been proposed. Addressing all of them is almost impossible, due to both the 
large number of works published in this field (early works date back to early 
80's [376], see also [416]) and the different approaches proposed. Like for the 
DL case, essentially they differ in the underlying notion of uncertainty theory 
and vagueness theory (probability theory, possibilistic logic, fuzzy logic and 
multi-valued logic) and how uncertainty /vagueness values, associated to rules 
and facts, are managed. 

We report below a list of references and the underlying fuzziness and un- 
certainty theory in logic programming frameworks. The list of references is 
by no means intended to be all-inclusive. The author apologizes both to the 
authors and with the readers for all the relevant works, which are not cited 
here. 


Probability theory: [29, 30, 34, 35, 102, 117, 119, 115, 116, 118, 163, 352, 
353, 354, 232, 233, 231, 252, 250, 254, 255, 278, 279, 280, 281, 282, 282, 
283, 310, 322, 327, 328, 329, 330, 349, 350, 455, 462] 


Possibilistic logic: [5, 6, 7, 96, 137, 179, 331] 


Fuzzy set theory: [309, 31, 32, 79, 90, 97, 144, 192, 214, 240, 176, 175, 309, 
324, 323, 346, 363, 375, 376, 377, 298, 436, 451, 458, 456, 459, 460, 470] 


Multi-valued logic: [65, 81, 103, 104, 105, 112, 110, 106, 107, 108, 109 
112, 121, 120, 124, 122, 123, 154, 155, 156, 157, 159, 180, 234, 235, 236, 
239, 247, 249, 251, 253, 266, 267, 268, 269, 270, 271, 272, 273, 274, 300, 
302, 301, 305, 311, 312, 318, 318, 316, 317, 319, 364, 372, 373, 374, 402, 
400, 403, 408, 409, 414, 413, 429, 446] 


While there is a large literature related to the management of fuzziness in 
logic programs, there are frameworks that are general enough to cover a large 
amount of them (see, e.g., [289, 400, 416, 456]). Roughly, rules are of the form 


At f(Bı, wn Bn) 3 
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where A, B; are atoms and f is a total function f : S” > S over a truth 
space S. Computationally, given an assignment /interpretation Z of values to 
the B;, the value of A is computed by stating that A is at least as true as 
f(Z(Bs), .... (Br). 

The form of the rules is sufficiently expressive to encompass many ap- 
proaches to many-valued logic programming. [289] provides an even more 
general setting as the function f may also depend on the variables occurring 
in the rule body. On the other hand there are also some extensions to many- 
valued disjunctive logic programs [311, 312, 403]. In some cases, e.g., [253] 
there is also a function g, which dictates how to aggregate the truth values in 
case an atom is head of several rules. 

Most works deal with logic programs without negation and some may 
provide some technique to answer queries in a top-down manner, as e.g., [104, 
239, 253, 456, 402]. On the other hand, there are very few works dealing with 
normal logic programs [107, 154, 155, 266, 267, 268, 269, 270, 271, 272, 311, 
400, 403, 408, 413, 288], and little is know about top-down query answering 
procedures. The only exceptions are [400, 408, 413]. 

Another rising problem is the problem to compute the top-k ranked an- 
swers to a query, without computing the score of all answers. This allows to 
answer queries such as “find the top-k closest hotels to the conference loca- 
tion.” Solutions to this problem can be found in [289, 409, 414, 428]. 

In the following, we will address the foundational part of rule-based lan- 
guages. 


11.2 Fuzzy Datalog Basics 


As for Mathematical Fuzzy Logic, the main idea is that an atom p(t), 
rather than being interpreted as either true or false, will be mapped into a 
truth value (or score) r in [0, 1]o (examples of other truth spaces can be found 
in, e.g., [103, 250, 402]). 

A knowledge base K = (F,P) consists of a finite facts component F and a 
finite LP component P, which are both defined below. 


Facts Component. F is a finite set of expressions of the form 


(plci. -.,Cn) r}, 


where p is an n-ary relation, every c; is a constant, and r > 0 is a rational in 
[0, 1o. 

The underlying meaning of such an expression is that the degree of truth 
of p(c1,...,Cn) is equal to or greater than r. For each p, we represent the 
facts (p(c1,...,Cn), r} in F by means of a relational n + 1-ary table T,, con- 
taining the records (c1,...,Cn,r). As usual, we assume that there cannot be 
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two records (C1,...,Cn,11) and (C1,...,Cn,72) in Tp with ra < ra (if there are, 
then we remove the one with the lower score). We assume each table sorted in 
descending order with respect to the scores. For ease, we may omit the score 
component and in such cases the value 1 is assumed. 


Rule Component. P is a finite set of rules, which are similar as fuzzy con- 
junctive RDFS and DL queries (See Sections 9.3 and 10.4), and are of the 
form (where s = (51,...,5n), n > 0) 


(p(x), 8) — Iy.(pi(Z1),81),..., (Pn(Zn); $n), =f (S, Z) , (11.1) 
where 
1. pis an n-ary relation, every p; is an n;-ary relation; 
2. x are the distinguished variables; 


3. y are existentially quantified variables called the non-distinguished vari- 
ables; 


4. z and every Z; is a tuple of constants or variables in x or y; 


5. s and elements in s are called scoring terms. Scoring terms are either 
rational numbers in [0, 1]g or variables, called scoring variables; 


6. all variables in x, y, and variables in {s, s1,...,8,} are pairwise disjoint 
and all variables in x occur in at least one of the z;; 


7. f is a scoring function f: [0, 1lo — [0, 1]g, which computes the overall 


score to be assigned to the rule head. We assume that f is monotone, 
that is, for each v, v’ € [0,1], such that v < v’, it holds f(v) < f(v’), 
where (v1,...,01) < (v1,...,v])iffu; < v} for all i. We also assume that 


the computational cost of f is bounded by I. 


Similar to the crisp Datalog case, we call (p(x),s) the head and 
dy.(pi(Z1), $1),---; (Pnl[Zn), sn), $:=f(s,z) the body of the rule; if clear from 
the context, we may omit the existential quantification 3y; as for the crisp 
case, we assume that predicate names in a rule body are distinct, i.e., pi A pj 
for i # j; we also assume that relations occurring in F do not occur in the 
head of rules (so, we do not allow that the fact relations occurring in F can be 
redefined by P); and the relations in F are called extensional relations, while 
the others are intensional relations. We further call s:=f(s, z) a scoring atom 
and an expression (p(z), s) a fuzzy atom. 


Remark 42. In the following we will make the following convention: if a 
scoring term is omitted then the value 1 is assumed and if the scoring term 
in the rule head is a rational r € [0, 1]g, then the scoring atom is of the form 
ri=f(...). For ease of presentation, if there is at least one fuzzy atom in the 
rule body, we may omit a scoring atom of the form r:=r, for r € [0, L]o. 
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An example of fuzzy rule is the following: 


(GoodHotel(x),s) + Hotel(x), (Cheap(x), sa), 
(CloseT oV enue(x), s2), (Comfortable(x), s3), 
s:= 0.3- sı +0.5-s9+0.2- $3 (11.2) 


The intended meaning is to retrieve good hotels, where the degree of goodness 
is a function of the degree of being cheap, close to the venue, and comfortable. 


Remark 43. Note also that a classical LP rule 


p(X) + pi(Z1),...,Pn[Zn) ; 


is nothing else than a special case of fuzzy rules, and, e.g., îs the same as 
writing 
(p(x), 1) — (pr(21),1),-.-, (PnlZn), 1) - 


So, e.g., all rules of Example 17 are fuzzy rules as well. 


Remark 44. In the following, a fuzzy rule 


(p(x), s) SS dy.(p1(Z1), 51), ...> (pn (Zn); Sn)» s:=f (s) > 


in which the scoring function in the scoring atom depends on the scores of 
the atoms in the body only (and all s,s; are scoring variables), is represented 
succinctly as 


P(x) — fprlz1),---,Pnl2n)) - 


According to Remark 44, the rule in Equation (11.2) can be represented as 
well as 


GoodHotel(x) + min(Hotel(x),0.3 - Cheap(x) 
+ 0.5 : CloseToVenue(x) + 0.2 - Com fortable(x)) . 


Semantics. From a semantics point of view, given K = (F,P), the notions of 
Herbrand universe Hy (the set of all constants occurring in K) and Herbrand 
base Bx of K (the set of all ground atoms that can be formed using constants 
in Hx and atoms occurring in K) are as usual. 

Similarly to the crisp case, given K, the (possibly infinite) set of ground 
rules K* derived from the grounding of K is constructed as follows: 


1. set K* to be {(p(t),n) + n:=n | (p(t), n) e F,r € [0, lo}; 


2. add to K* the set of all ground instantiations of rules in K, in which we 
do also replace scoring variables with rational values in [0, 1]o. 


Remark 45. Note that in K* we replace scoring variables with rational values 
in [0, 1]g. Therefore, K* does not consists of a finite set of rules iff at least 
one scoring variable is present in P. 
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An interpretation T for K is a function I: Bx > [0, 1]g. Given K, we say that 
T satisfies (is a model of) 


e a ground fuzzy atom (A,r), where A € Br, denoted Z — (A,r), iff 
T(A) >r; 


e a ground scoring atom r:=f(v) iff r = f(v). For ease, we use f both 
denoting the logical function symbol as well as the interpreted function; 


e a ground rule body ¢ of a rule y + pe K*, denoted Z } q, iff Z is a 
model of all atoms in @; 


e a ground rule r € K*, denoted T f r, iff Z is a model of the head of r 
whenever Z is a model of the body of r; 


e K*, denoted T E K*, iff Z satisfies all rules r € K*; 
e KÇ, denoted T E K, if Z is a model of K*. 
Example 45. Consider K as follows: 


F ={ (p(a),0.2), 
(p(), 0.5) } 


r otherwise. 


=f 0 ifr<03 


Then 
Ak = {a,b} 
Be = {p(a),p(b),q(a),q(b)} 
K* = {(p(a),r) = r:=0.2 | r € [0, 1g} U 
(p(b), r} = r:=0.5 | r € [0, 1]opU 
(qla), r) = (pla), r’), r:=f (r°) | r,r" € [0, Ho} U 
(al 


0a 
i = 07 
Ti(q(a)) = 0.5 
)) = 08 
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and 


Zo(p(a)) = 04 
Zo(p(6)) = 0.7 
lala) = 05 
Za(q(6)) = 06. 


Note that Zi and Ta differ only in the interpretation of q(b). Now, it can be 
verified that T; is a model of K* and, thus, K, while Ta is not. Specifically, Ta 
is not a model of the rule 


(a(b), 0.7) — (p(0), 0.7), 0.7:=f (0.7) 
because, La is a model of the rule body, but is not a model of the rule head. 


Now, let us extend the < relation from [0, 1] to the set Ik of all interpretations 
point-wise: (i) Zi < Ta iff T¡(A) < Za(4), for every ground atom A € Bx. 
With Z, we denote the bottom interpretation (Z, maps all atoms to L), while 
with Zr we denote the top interpretation (Zr maps all atoms to 1). It turns 
out that (Ik, <) is a complete lattice. 

As for the crisp case, one model plays a special role: namely the <-least 
model Mx of K. The existence, finiteness, and uniqueness of the minimal 
model Mx are guaranteed to exist by the following argument. Consider the 
function Tk: Ik > Ik defined as follows: for all ground atoms A € Br 


Tx(Z)(A) = supir | (A,r) 0€XK*st.T Ed). (11.3) 
Then, it can be shown that Tx is monotone on Ix, i.e., if Zi < Za then 


Tk(Z1) S Tx (Ze). In fact, fn <Z A (A,r) ee p € K* and h = 0) then 
I> = g as well. Therefore, 


Tx(Zi)(A) = sup{r| (A,r) — de K* s.t. T1 = 0) 
< sup{r| (A,r) — dE K* s.t. Ta = 0) 
= Tk(Z2)(A) . 


Therefore, by the well-known Tarski-Knaster fixed-point theorem [439] for 
monotone functions over complete lattices, we get immediately the analogue 
of Proposition 16: 


Proposition 85. For a knowledge base K, there exists a unique minimal 
model Mx. that is the least fixed point of the function Tk. Mx can be obtained 
as the limit of the <-monotone sequence, To,...,L;,..., where 


Lo = Ti 


Zuni = Teh). (Ha) 
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Example 46. Consider Example 45. Let us illustrate the computation of the 
minimal model. We do only report the changes to the previous step. 


To = Li 
Ti(p(a)) = Tk(Zo)(p(a)) = 0.2 
T,(p(b)) = Tx (Zo)(p(b)) = 0.5 
Talq(b)) = Tx(Zi)(q(b)) = 0.5 


la = Te(la)=Z>. 


Therefore, Mx = T is the minimal model with 


Mx(p(a)) = 0.2 
Mx(p(b)) = 0.5 
Mx(q(a)) = 0.0 
Mx(q(b)) = 05. 


Let us remark that however, a finite number of iterations may not suffice 
to compute the minimal model as illustrated by the following example (for 
similar examples, see, e.g., [239, 456]). 


Example 47. Consider the score combination function f(x) = 4% (0 <a < 
1), and Kı containing the rules: 


(p,s) +} s:=0 
(p,s) + (ps), s:=f(s). 


Then the minimal model is attained after w steps of Tk iterations starting 
from Zu and is Mx, (p) = a. 

Now, consider the (non-continuous) function g(a) =0 ifa < a, and g(x) = 
1 otherwise, and Kz containing only the rules 


(p,s) +} s8:=0 
(ps) <— (p,s),s:=f(s) 
(as) += (p,s),s:=gls 


Then the minimal model is attained after w+1 steps of Tk iterations starting 
from Zu and is Mx,(p) = a, Mx. (q) = 1. 


However, it can be shown that if all functions appearing in P are Scott- 
continuous, then at most w steps are necessary to compute the minimal 
model [400, 402]. 
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In any case, finiteness of the truth space, monotonicity of score combination 
functions together with the finiteness of the grounded program guarantees 
then that the minimal model can be computed in finite time by iterating Tx 
over |. 


Remark 46. Note that the requirement of being the scoring functions mono- 
tone is crucial to guarantee the existence of a unique minimal model. For 
instance, the KB 


) 


involving the non-monotone scoring function 1 — x, has two minimal models, 
M! and M?: namely 


Mila) = 0.7 

MẸ (b) 0.3 
and 

Méla) = 0.2 

Mi(b) = 08. 


11.3 Concrete Domains 


The notion of concrete domains naturally extends the one for crisp Datalog 
presented in Section 5.3 and is coherent with what proposed for the fuzzy 
propositional logic case (see Section 8.2.3), where now a concrete atom, rather 
being true or false only, is mapped into [0, 1]y (see, e.g., [398]). Specifically, we 
assume a set of constants that are data values and a set of datatype predicates, 
where each datatype predicate has a predefined arity n > 1. A datatype theory 
D=(AP, -P) consists of a finite datatype domain AP and a mapping -P 
that assigns to each data value an element of AP and assigns to each n-ary 
datatype predicate p a n-ary function pP: AP x ... x AP > [0, 1]o. 

A concrete atom is an expression of the form d(z), where z is a vector of 
variables or data values. For instance, 1s(18000, 22000) is the well-known left- 
shoulder fuzzy set membership function we have seen in Section 8.1.1, which 
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maps a data value into [0, 1]g. A concrete atom d(z) may occur in a rule body 
only in the form of a fuzzy atom expression (d(z), s). 

As in Section 5.3, let us extend both the Herbrand universe as HD = 
Hx U AP and the Herbrand base BP to the set of all ground atoms and 
ground concrete atoms that can be formed from the constants in the Herbrand 
universe. Then, an interpretation T for K is a function I: BP — [0, 1]g with 
the additional condition that d? = dP, for each datatype predicate d. 

Now, the notions of satisfaction (is model of) are extended in the obvious 
way to rules including concrete fuzzy atoms. For instance, an interpretation 
T is a model of a ground concrete fuzzy atom (d(v),r) iff dP(v) > r. It is 
then straightforward to see that the analogue of Proposition 85 still applies 
to interpretations over BP in place of Bx. 


11.4 Fuzzy Conjunctive Queries 


Similarly to Section 5.4, a fuzzy query is a fuzzy rule 


(a(x), 8) — p(x, y, s), s:=f (x, y,8S) , (11.5) 


and the answer set of a query q w.r.t. a KB K (denoted ans(K, q)) is the set 
of tuples (t,r) such that Mx (q(t)) = r > 0, where K’ is obtained from K 
by extending it with the query rule (11.5) (Mx is guaranteed to exists by 
Proposition 85). 


Example 48. Given K with rule 
q(x) 0.5 - p(x) +0.7 - r(x) 


and facts 
(p(a),0.4), (p(b), 0.2), — (r(b), 0.5) 
then ans(q, K) = {(a, 0.45), (b, 0.2) }. 
In the following, to ease the presentation, we will also say that a fuzzy query is 


of the form q(x), intended as a question about the truth degree of the ground 
instances of q(x) in the minimal model of K. 


11.5 Reasoning 


We present here some methods to determine the answer set ans(K,q) of 
a query q, given a KB K. The first one is a generalization of the one SLD- 
resolution methods presented in Section 5.5.1, the second one maps fuzzy LPs 
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into crisp LPs, while the third one addresses a rather novel problem in the 
context of fuzzy LPs, namely the problem of determine the top-k answers [289, 
409, 414, 428]: 


Top-k retrieval. Given K, retrieve top-k tuples of the answer set of q ranked 
in decreasing order relative to the score, denoted 


ans (K,q) = Top, (ans(K, q)) . 


For the sake of illustrative purposes, we consider the following well-known 
example in which we define a path in a weighted graph. 


Example 49 ([428]). Consider the following rules 


ru:path(z,y) +  edge(x,y) (11.6) 
ra :path(z,y) < min(path(z,z),edge(z, y)) (11.7 
and the facts 
Teage 

c b 0.6 

a c 05 

b a 0.4 

a b 03 


It can be verified that the set of answers of predicate path is given by: 


ans(K, path) 
a a 04[|b a 04lc a 04 
a b 05ļ||b b 04l[c b 0.6 
a c 05b c 04lc c 04 


The top-3 answers for path are instead: 
ansk(K,q) = {(c, b, 0.6), (a, b, 0.5), (a, c, 0.5)} . 
We next introduce the following requirements. 


Truth space. To always guarantee termination of the procedures presented 
here, we will assume that the truth space is £ = {L,0, 1, Ss r, 1}, for some 
positive integer n. We will use the symbol L to denote the truth “undefined” 
and extend the linear order < over rational numbers by postulating L < 0. In 
the rules, we assume that the scoring function is such that f(...,L,...) =L. 
We assume that for a fact (p(t),r), r > L holds. The semantics is a straight- 
forward adaption of the one presented in Section 11.2 and Proposition 85 
extends easily to this setting as well: 
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Proposition 86 ([428]). Given a truth space £L =4(1,0,%,...22,1), then 


In? 


for a knowledge base K, there exists a unique minimal model Mx that is the 
least fixed point of the function Tk. Mx can be obtained as the limit of the 


<-monotone sequence, Lo,...,L;,...,, where 
Do = Li 
11.8 
Liy = Tk(T;). (1.8) 


in which the interpretation T, maps all ground atoms to L. Furthermore, the 
number of iterations is bounded by |L]. 


From a practical point of view the finiteness of £ is a limitation we can live 
with especially taking into account that computers have finite resources, and 
thus, only a finite set of truth degrees can be represented. For other conditions 
that guarantee termination, we refer the reader to, e.g., [103, 104, 105, 111, 
112, 239, 400, 402]. 


Remark 47. Note that we have introduced 1 to distinguish the case where 
a tuple t may be retrieved, even though the score is 0, from the case where a 
tuple is not retrieved, since it does not satisfy the query. In particular, if a 
tuple does not belong to an extensional relation, then its score is assumed to 
be undefined (L), while if T would be defined on [0,2,...22,1), then the 


rn? 


score of this tuple would be 0. The semantics is inspired by [289, 428]. 
To further illustrate the case, consider the following example. 


Example 50. Consider again Example 48. Then, w.r.t. L = {1,0, b EL >> 1} 
ans(q,K) = {(b,0.2)}. Please, note that (a,0.45) g ans(q,K) since 


Mx(r(a)) =L and 0.5-0.9 +0.7- 1 = L, by definition of scoring functions. 


Eventually, it is not difficult to see that, w.l.o.g. we may assume that all fuzzy 
rules are of the form 


(p(x), 8) < 3y. (pı (zı), s1), wey (Pn(Zn), Sn), s:=f(s, z) > (11.9) 


in which all scoring terms s, s; are scoring variables, and all variables occurring 
in the rule occur in the scoring function as well. This may easily achieved by 
observing that 


e a fuzzy atom (p;(z),1r) occurring in the rule body with r € [0, 1]g, may be 
replaced with a fuzzy atom (p;(z), s”), where s’ is a new scoring variable 
and where the scoring function f(s,z) is replaced with 


9(s,z,5') = fh its > r 


=] otherwise; 


e a fuzzy atom (p(z),r) in the head with r € [0, 1]g, may be replaced with 
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a fuzzy atom (p(z), s’), where s’ is a new scoring variable and where the 
scoring atom r:=f(s, z) is replaced with s:=g(s, z), where 


FA a f if f(s,z) =r 


L otherwise; 


e if a variable z’ occurring in the rule and not being the scoring variable 
in the head, does not occur in the scoring function, then the scoring 
function f(s, z) is replaced with 


g(s, z, 2’) = f(s, z). 
For instance, note that a rule! 
(p, 0.7) — (q, 0.5) 
is replaced with the rule 


(p, 8) = (q, 8’), s:=g(s”) (11.10) 


ene: a if s > 0.5 


where 


1 otherwise. 


In fact, in the rule 
(p,0.7) E (q, 0.5), 0.7:=0.7 


by replacing (q,0.5) the rule becomes 


(p, 0.7) = (q, 8"), 0.7:=f(s’) 


(s’) 0.7 ifs >0.5 
S == 
db | otherwise 


where 


and then by replacing (p, 0.7) we obtain the rule in Equation (11.10). 

In the following, we will assume that the answer set of a query q w.r.t. K 
is defined as the set ans(q, K) of tuples (t,r) € Hic x... x Hg x £ such that 
Mx(q(t)) =r > L (the score of t is r > L in the minimal model). 

We conclude this section by recalling the notion of recursive / non-recursive 
KB from Section 5.5.2, i.e., given K, we say that relation symbol p directly 
depends on relation symbol q if there is a rule in K such that p occurs in 
the head of it and q occurs in the body of it. The relation depends on is the 
transitive closure of “directly depends on.” The dependency graph of K is a 
directed graph where nodes are relation symbols and the set of edges is the 
“directly depends on” relation. The KB is recursive if there is a cycle in the 
dependency graph (i.e., there is p depending on p). 


1Recall that the scoring atom 0.7:=0.7 is omitted. 
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11.5.1 SLD-Resolution Driven Query Answering 


The method proposed here naturally generalizes the one presented in Sec- 
tion 5.5.1 to the fuzzy case and is inspired to [104, 239, 456]. We first present 
the propositional case and extend it to the FOL case. 

The propositional fuzzy SLD inference rule is as follows: assume we have 
propositional rules in which we assume that all scoring variables in the second 
rule have been renamed in order not to share any variable with the first one. 

From (4,8) + (41,81),-..,(Aj,Sj),---, (Ax, sg), s:=f (8) 


and (B,s') = (B1,81),--- (Bm, Sin), 8’ =9(s’) 
and B = Aj 


J 
infer (A, s} — (A1,81),-.., (Aj—1, 83-1), (11.11) 
(Bı, si), e.’ (Bm, Sm) 
(Aj4i, Sj+1) >> (Ap) Sk); 
s:i=f(s1,...,8j-1,9(8/), Sj41)--->8k) - 


The propositional atom B is called the selected atom. Essentially, we replace 
the fuzzy atom (A,,s;) with the fuzzy atoms (B1, s81), --., (Bm, Sm) and ac- 
cordingly replace the scoring variable s; occurring in the scoring function f 
with g(s’). 
When the second rule is replaced with a fact, the specialized fuzzy SLD 
inference rule is: 
From (A,s) — (Ai, 81),..-, (Aj, Sj) ---, (Ak, sk), s:=f (8) 
and (B,r) 
and B = Aj 
infer (A, 8) — (A1,81),-.., (Aj-1,Sj-1), 
(Aj415 8541) +++ > (Aks Sk)» 
s:i=f(S1,...,Sj-1)1,Sj41)-->>8p) + 


(11.12) 


[Ya 


In the above inference rules, we assume that the scoring function is evaluated 
as soon as all required parameters are ground. 

A fuzzy SLD-derivation of q w.r.t. K consists of a finite sequence of rules 
T1,...,Tn, each of which has q as head, each rule r;,1 is inferred from r; via 
fuzzy SLD-resolution, and r,, is the rule 


(q, 8) = s:=r 


telling us that indeed the truth of q in the minimal model is greater or equal 
than r € [0, 1]o. r is called a computed answer of q w.r.t. K. 


Example 51. Consider the following KB K with rules 


(as) + (D1,81), (p2, $2), 5:=81 ` 82 (11.13) 
(q,s) +=  (p3,s3),s':=max(0.5, s3) (11.14) 
(pas) <  (p1,sa), (pa, 55), 85” :=0.3 - s4 +0.7- 85 (11.15) 
and facts 
(p1,0.7) (11.16) 


(p2, 0.6) (11.17) 
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Let us determine the truth of q in the minimal model of K. The following are 
two fuzzy SLD-derivations of q w.r.t. K: 


(q, s} — (p2, s2), s:=0.7 -s2 via (11.13), (11.16) (11.18) 
(q, s} E s:=0.42 via (11.18), (11.17) (11.19) 


(q, s') — (pi, sa), (pa, 85), s' = max(0.5, 0.3 - s4 + 0.7 - s5) via (11.14), (11.15) (11.20) 
(q, s') = (p2, s5), 8’ := max(0.5,0.3 -0.7 + 0.7 - s5) via (11.20), (11.16) (11.21) 
(q,s') = s1:=0.63 via (11.21), (11.17) (11.22) 


It is easily verified that there is no other fuzzy SLD-derivation of q ending 
with a higher score than 0.63. 


It can be shown that 


Proposition 87 (Soundness & Completeness [104, 239, 456]). Given a KB 
K and a query q, then 


1. ifr € [0, llo is a computed answer of q w.r.t. K then Mx(q) > r; 


2. there is a (finite) fuzzy SLD-derivation such that Mx(q) is a computed 
answer of q w.r.t. K. 


We next address the FOL case, which is essentially the same as the proposi- 
tional case described above, except that now we have to take unification into 
account. 
The inference rules are the following (as usual, we assume a variable re- 

naming of the second input in the inference rules): 

From (A, s) s= (Az, s1), ...) (Aj, sj), ...) (Az, Sk); s:=f(2, s) 

and (B,s') — (B1, 81), (Bm, Sm); S =9(z2', 8’) 

and 0 as a mgu of {B, Aj) 


infer (AO, s} — (410,51),..., (45-10, 55-1), (11.23) 
(B10, s1), ie Bmð, sm 
(Aj+10, 8541) - -< (Ak, sp), 
s:=f(z,81,...,sj-1,9(2,8'), 854 1),---,Sk)0 . 


From (4,5) + (41,81),...,(Aj,S5),-.., (Ap, Sk), s:=f (2, 8) 
and (B,r) 
and 0 as a mgu of {B, Aj) 
infer (AD, s) S (A10,51),...,(Aj-10,55-1), 
(Aj+10, Sj+1) a (Az 0, Sk), 
s:=f(2,S1,...,Sj-1,7,8j41)-»»,SpJO . 


(11.24) 


Now, a fuzzy SLD-derivation of a query q w.r.t. K consists of a finite sequence 
of rules rj,...,Tn, each of which has q as head, each rule r;,1 is inferred from 
ri via fuzzy SLD-resolution, and r,, is the rule 


(q(x)0, s) — s:=r 
telling us that indeed the truth of 


q(x) 
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in the minimal model is greater than or equal to r € [0,1]g. With 0), we 

denote the vector (110,...,t,0), i.e., the restriction of the substitution 0 to 

the variables in x. In that case, we say (0;,,r) is a computed answer for q. 
As for the propositional case (see Proposition 87), it can be shown that 


Proposition 88 (Soundness & Completeness [104, 239, 456]). Given a KB 
K and a query q(x), then 


1. if (t,r) is a computed answer, then Mx(q(t)) > r, i.e., there is an 
answer (t,r”) € ans(K, q) such that r’ > r; 


if (t,r) € ans(K, q), then there is a (finite) fuzzy SLD-derivation such 
a en) is a commuted answer of q(x) w.r.t. K. 


Therefore, in order to compute the answer set ans(K,q) via fuzzy SLD- 
derivation, it suffices to determine all computed answers of q w.r.t. K. 


Example 52. Let us consider a FOL variant of Example 51. Consider the 
following KB K with rules 


(a(z),s) <=  (pi(z2),s1), (pa(2), 82), s:=81 > 82 (11.25) 
(q(z),s) +  (ps(x), 83), $:=max(0.5, s3) (11.26) 
(p3(a’), 8’) (p(x), sa), (p2(2’), 55), s':=0.3 + s4 +0.7-85 (11.27) 
and facts 
(pı (a), 0.7) (11.28) 
(p2(a), 0.6) (11.29) 


Let us determine the answer set of q. The following are two fuzzy SLD- 
derivation of q(x) w.r.t. K: 


(q(a),s) + (pa(a), s2), s:=0.7 -s2 (11.25), (11.28), 
01 = {x/a} (11.30) 


(q(a), s} + s:=0.42 (11.30), (11.29) (11.31) 


(q(x), s) + (pı(z), sa), (p2 (£), 85), s:= max(0.5, 0.3 - s4 + 0.7 - s5) (11.26), (11.27), 
02 = {x'/x} (11.32) 


(q(a), s) — (p2(a), ss), s:= max(0.5, 0.3 -0.7 + 0.7: s5) (11.32), (11.28), 
02 = (a /x, x/a}(11.33) 


(q(a),s) + s:=0.63 (11.33), (11.29) (11.34) 


Hence, we have two computed answers (a, 0.42) and (a, 0.63). It easily verified 
that no other computed answer (a, r) with r > 0.63 exists and, thus, (a, 0.63) € 
ans(K, q) (specifically, we have ans(K, q) = {(a, 0.63) }). 
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Example 53. Consider Example 49 and the query 


(q(x, y), s) = (path(z, y), 81), S:=8] . 
The following is a SLD-derivation of q w.r.t. K (see also Example 19): 


(q(z,y),5) + (path(a,y), $1), s:=81 

(q(x, y),8) + (path(a, z), $1), (edge(z, y), $2), $:=min(s1, sa) 

(q(a,y),8) + (path(a, 21), 81), (edge(z1, 2), 82), (edge(z, y), 83), 5:=min(s1, s2, 53) 
(q(x,y),5) += (edge(x, 21), sa), (edge(z1,2), 52), (edge(z, y), sa), s:= min(s1, sa, 83) 
lale,y),s) + (edge(d, z), s2), (edge(z, y), 53), s:= min(0.6, s2, 53) 

lale,y),s) +=  (edge(a, y), sa), s:= min(0.6, 0.4, 53) 

(q(c,c),s) +  s:=min(0.6,0.4,0.5) . 


((c,c),0.4) is an answer computed by this SLD-derivation. 


11.5.2 Reduction to Classical Logic 


Another and simple way to reason with fuzzy rules consists in mapping 
them into crisp LP rules, if arithmetic built-in atoms are supported (which 
is often the case). Essentially, we map an n-ary fuzzy atom (p(z),r) into an 
n + l-ary atom plz, 1), i.e., 


(p(z), r) > plz,r) (11.35) 


and use the additional slot as a place holder of the degree of truth of the fuzzy 
atom. So a fact is mapped as 


(p(t),r) > plt,r), 


while the rules having the same predicate in the head, 
(p(x),s) + yi(,y1,81) 
(p(x),s) — pr(X, Yk; Se) 
can be transformed as 
pi (xs) — p1(x,y1,S1) 


P(x, s) hres P(x, Yk, Sp.) 


p(x,s) = pi(x,s1),...,pp(X, Sk), $= max(s1,..., 8K) , 
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where p; are new predicate symbols and p'(x,y;,s;) is obtained from 
pi(x, yi, Si) by using the replacement schema of Equation (11.35), which con- 
cludes the transformation. 


11.5.3 Top-k Query Answering 


While there are many works addressing the top-k problem for vague queries 
in databases (cf. [78, 94, 150, 152, 212, 213, 258, 257, 306]), little is known 
for the corresponding problem in knowledge representation and reasoning [97, 
98, 110, 111, 289, 224, 358, 360, 404, 409, 410, 414, 421, 457]. 

We note that retrieving the top-k answers of an extensional relation p is 
trivial as we have just to retrieve the first k tuples in the relational table 
T, associated to p. Hence, we restrict top-k retrieval to intensional relations 
only. Note also that (i) ans,(K,q) is not necessarily unique as there may be 
several tuples having the same score of the bottom ranked one in ansg(K, q) 
(we assume in this case that ties are broken arbitrary); and (ii) there may be 
less than k tuples in ans;,(K,q) as there might not be k non- scored tuples 
in ans(K, q). 

Furthermore, please note that having introduced the degree 1 € £ allows 
us to be compatible with top-k SQL retrieval over relational databases (see, 
e.g., [258]), where it is understood and common practice not to retrieve tuples 
that do not satisfy the condition of the plain SQL query (that is, the SQL 
query where the function computing the score is left out), and, in particular, 
tuples not occurring in any relational table (see e.g., Example 50). 

In the following, we address first the top-k retrieval problem for non- 
recursive KBs [409] and then address the general case [423]. 


11.5.3.1 Top-k Retrieval for Non-Recursive KBs 


Consider a non-recursive KB X. Interestingly, for such case there is a 
simple procedure to determine the top-k answers of a query [409], which closely 
resembles the one for DL-Lite (see Section 10.6.3): 


1. by considering the rule component, the user query q is reformulated into 
a set of conjunctive queries r(q, K) using the fuzzy SLD-resolution rule; 


2. the reformulated queries in r(q,K) are then evaluated over the facts 
component, which is stored in a database, producing the requested top- 
k answer set ans; (XK, q). 


Specifically, given the fuzzy SLD-resolution rule (11.23), let r,r',r” be the 
first, second, and inferred rule, respectively. We say that rule 7” is applicable 
to atom A, occurring in rule r and we call r” the resolvent of r and A;, 
denoted resolve(r”, Aj). 

The query reformulation algorithm (Algorithm 6) is is quite similar to the 
DL-Lite case (see Section 10.6.3). Given a query q and a KB K, the algorithm 
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reformulates q in terms of a set of conjunctive queries r(q, K), which then can 
be evaluated over the facts component in K. 


Algorithm 6 QueryRef (q, K) 


Input: query q, KB K. 
Output: Set of reformulated conjunctive queries r(q, K). 


1: r(q,K) := {a(x) — ¢ | a(x) = pe K} 


3 
4 for all queries q € S do 

5: for all atoms A; € q do 

6: if rule r’ € K is applicable to A; then 
es r(q,K) := r(q,K) U {resolve(r’, A;)} 
8: until S = r(q,K) 

9: return r(q,K) 


The termination of the query reformulation step is guaranteed by the non- 
recursiveness of K. This concludes the query reformulation step. 


Example 54 ([409]). Suppose the rule component P contains the rules rı and 
Tra: 

ri: Tle,y) + Ple,y) 

ro:  R(z,y) = Blx),Cly). 


We also assume that the fact component F is 


P B| [C 
OȚI 1 I 
312| 12] T3] 
415 5 2 
618 T 4 


Assume also that we have the rules 


qd: laa),s) = T(z,y), R(y,z),s:=max(0,1 — x/10) 
qa”: (q(x),s) < C(x),s:=max(0,1—ux/5). 


Now, it can be verified that, e.g., 


K E (a(3),0.7) 
K = (q(1),0.8), and 
K We (q(6),v) for any v € [0,1] , 


The answer set is 
ans(K,q) = {(0,1.0), (1, 0.8), (2, 0.6), (3, 0.7), (4, 0.6)} 
and, thus, that the top-3 answers are 


ans3(K,q) = {(0, 1.0), (1, 0.8), (3,0.7)} . 
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It can also be verified that 


r(q, KC) = (d,q”,q1,q2) > 
where 
qı: (g(x),s) + P(x,y), Rly, z), s:=max(0,1— x/10) 
q2: lala)s) + P(x,y), Bly), C(z), s:=max(0,1— 2/10) . 


It is easily verified that now we have only to evaluate q” and q2 over the facts 
component of K, as both q! and qı queried over the fact components provide 
empty results only. Specifically, we have that 

ans(F, q") {(1, 0.8), (3, 0.4), (2, 0.6), (4, 0.2)} 

ans(F, q2) = {(0, 1.0); (3, 0.7), (4, 0.6)} E 


Hence, ans(K, q) is obtained from ans(F, q") Uans(F, q2) from which we drop 
a tuple (t,r) if there is another tuple (t,r') in it with r! >r. 


The main property of the query reformulation algorithm is as follows. It can 
be shown that 


Proposition 89 ((409]). Given a KB K =(F,P) and a query q, then 
ansk(K, q) = Toppi (t, v) | q, E r(q,P), (u(t), v) e ans(F, q;)) - 


The above property dictates that in order to determine the top-k answers, we 
may reformulate the query q using the rule component P only, and then query 
the reformulated queries q; € r(q, P) against the fact component F stored in 
a relational database. From the union of these answer sets we can find the 
top-k answers. 

Of course, the size of the rewriting r(q, P) may be exponential w.r.t. to the 
size of q and P in the worst case as the following simple example illustrates. 


Example 55. Consider the query 
q(x)  Ao(z) 
and a set of rules P of the form (i =0,1,...,k — 1) 


Aj(x) Aaa), Ci41 (2) 
Alx) — Aj41(x), Dia (x) . 


Then {q} UP has 2k + 1 rules, whose size (number of atoms) is bounded by 
3. So, |a| + |P| is O(k). Now, it is easy to see that r(q,P) consists instead of 
2*+1 rules. The size of r(q,P) is 


k k k 
Y 24-642) = 27 214) i2 = 2.24414 (R121 49 = (41)-24+142, 
1=0 1=0 1=0 


which is O(k - 24) and, thus, is exponential w.r.t. |q| + |P]. 
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We next show how to find the top-k answers of the union of the answer 
sets of queries q; € r(q, P). 

A naive solution to the top-k retrieval problem is as for the case of DL-Lite: 
we compute for all q, € r(q,P) the whole answer set ans(F, q;), then we 
compute the union of these answer sets, order it in descending order of the 
scores and then we take the top-k tuples. 

A more effective solution consists in relying on existing top-k query an- 
swering algorithms for relational databases (see, e.g., [94, 150, 258]). Like for 
DL-Lite, we may compute for all q, € r(q, P), the top-k answers ansk(F, qi), 
using e.g., the system RankSQL [258]; and if both k and the number, 
ng = Ir(q,P)|, of reformulated queries is reasonable, then we may compute 
the union, of these top-k answer sets, order it in descending order w.r.t. score 
and then we take the top-k tuples. 

We can further improve this solution by adopting the Disjunctive Threshold 
Algorithm (DTA) we have seen also for DL-Lite. 

While computing r(q,P) may require exponential time w.r.t. the size of 
q and P (see Example 55), computing the top-k answers of each query q; € 
r(q,P) requires (sub) linear time w.r.t. the database size (using, e.g., [94]), 
and, thus, so does the DTA. 


11.5.3.2 Top-k Retrieval: The General Case 


We address here now the more general case in which P may be recursive 


(see [428]). 
Consider K = (F,P), consisting of a facts component F and an LP com- 
ponent P. F is a finite set of facts of the form (p(c1,...,Cn), s). We assume 


that all rules in P are of the form 


P(X) = f(pr(Z1), +++, Pm(Zm)) - 


For our purposes, in the following, let Hx and Bx be the Herbrand universe 
and Herbrand base of K, respectively. An interpretation Z for K is a function 
I: Be > L, where £ = {1,0,2 1 1} and 


in? 


1. Z(r) =r, for r € £; and 
2. T(f(4, tee , Am)) = F(T(A1), sp4 ,T(Am)))- 


With Zu we denote the bottom interpretation (Zu maps all atoms to L), while 
with Zy we denote the top interpretation (Zr maps all atoms to 1). 

We recap that Z is a model of K iff for all ground rules A + pe K*, 
L(y) < I(A) holds. Differently to Section 11.2, the existence, finiteness, and 
uniqueness of the minimal model Mx is guaranteed to exist by the following 
non usual argument. 

Consider K, the Herbrand base Be = ([41,..., An) and K*. Note that 
both are finite and so is K*. Let us associate each atom A; € Br with a 
variable z;, which will take a value in £ (sometimes we will refer to that 
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variable with z4 as well). An interpretation Z may be seen as an assignment 
of truth values to the variables x1, ..., 1,, and vice versa. Now, for each ground 
fact (A,r) in K*, we consider the equation 


LA ZT 
while for each ground rule A + f(41,..., Am) in K* we consider the equation 
ta = f(@a,,---, 0A, ) 


if A is head of at most one rule, while we consider the equation 
xa =max{f(...)| Ac f...) E K*} 


if A is head of more than one rule. 
Now, given K*, we have obtained the system of equations 


Tı = Ai la 
(11.36) 


Tn = Tn a > 


where the variables inside the functions f; belong to {x,..., En}. 

Each variable x;, will take a value in £, each (monotone) function fi 
determines the value of x; (i.e., Ai). We refer to the monotone system as in 
Equation (11.36) as the tuple S = (£,V,f), where V = {21,...,2,} are the 
variables and f = (f,,..., fn) is the tuple of functions. 

Now, it can be verified that the minimal model of K*, i.e., Mx, is bijectively 
related to the least solution of the system (11.36). As it is well known (see, 
e.g., [9)), a monotonic equation system as (11.36) (each function f;: L” — £ 
in Equation (11.36) is <-monotone) has a <-least solution, lfp(f), the <-least 
fixed-point of f is given as the least upper bound of the <-monotone sequence, 
Yo0)---,Yi,---, Where 

Ye = 1L 

Sid) 23 E (11.37) 
It is thus immediate that, as the truth space is finite, the above sequence 
converges in a finite number of steps and that the <-least model Mk of K is 
finite. 


Computing top-k answers. We describe an incremental query driven top-k 
query answering algorithm. 

A distinguishing feature of our query answering procedure is that we do 
not determine all answers, but collect, during the computation, answers in- 
crementally together and we can stop as soon as we have gathered k answers 
greater or equal than a computed threshold 6. 

For the ease of reading, we will proceed stepwise. In the next section, we 
will provide an algorithm answering ground queries. Additionally, we will see 
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later on that the same algorithm can be used to compute the threshold we 
are going to use later on. Eventually, we will extend the latter procedure to 
compute the top-k answers only. 


A query driven procedure for equational systems. It is illustrative to 
address the following specific problem. Consider an equational system S = 
(£, V,f) and a variable z;. How can we compute the value of variable x; in 
the least fixed point of S? The immediate way is to compute bottom-up the 
least fixed point as described in Equation (11.37), and then look for the value 
of x; in the least fixed point. But, there is also a query driven method [9]. The 
method has been used then in [400] as a basis for a query driven ground query 
answering method for normal logic programs and has further been extended 
in [289, 413, 414]. This is not surprising, as we have seen in the previous 
section that the minimal model of X*, i.e., Mx, is bijectively related to the 
least solution of a system of the form (11.36). Hence, if we want to know the 
truth value of a ground atom A in Mk, it suffices to look at the value of the 
variable x4 in the least fixed point of the related equational system. 

Formally, consider an equational system S = (£, V, f) of the form (11.36). 
The procedure described in Algorithm 7 determines the value of a set of 
variables in Q in the least-fixed point, and is a slight refinement of the one 
presented in [9]. 


Algorithm 7 Solve(S, Q) 


Input: <-monotonic system S = (£, V, f), where Q C V is the set of query variables 
Output: A mapping v that equals lfp(f) on Q 
1: A: Q, dg: Q, in: 0 
2: for all x € V do 
v(x) = L, exp(x) = false 
while A #9 do 
select z; € A, A: =A\ {ai}, dg: =dgUs(a;) 
ri = fi Galti J ..., v(Zia, )) 
if r > v(a;) then 
v(x;): =r, A: =AU (p(x;) N dg) 
9: if not exp(p;) then 
10: exp(z;) = true, A: = AU (s(x;) \ in), în: = in U s(z;) 
11: return v 


00; Te OS 


We next describe how it works. We use some auxiliary functions. s(x) 
denotes the set of sons of x, i.e., 


s(z;) E {Zi es ay 


(the set of variables appearing in the right-hand side of the definition of x; in 
(11.36)). p(x) denotes the set of parents of x, i.e., the set 


p(x) = {xi | x € s(z)) 


(the set of variables depending directly on the value of x). We assume that 
each function f;: L“ + £ in Equation (11.36) is <-monotone. We also use fa 
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in place of f;, for x = z;. Informally the algorithm works as follows. Assume 
we are interested in the value of xy in the least fixed point of the system. 
We associate to each variable x; a marking v(z;) denoting the current value 
of x; (the mapping v contains the current value associated to the variables). 
Initially, v(a;) is L. 

We start with putting zp in the active set of variables A, for which we 
evaluate whether the current value of the variable is identical to whatever its 
right-hand side evaluates to. When evaluating a right-hand side it might of 
course turn out that we do indeed need a better value of some sons (which 
are initialized with the value L) and put them on the set of active nodes to 
be examined. If it turns out that a variable changes its value (actually, it can 
only <-increase) all variables that might depend on this variable are put in 
the active set to be re-examined. At some point (even if cyclic definitions are 
present) the active set will become empty, because of the finiteness of the 
truth space, and we have actually found part of the fixed point, sufficient to 
determine the value of the query Zo. 

The variable dg collects the variables that may influence the value of the 
query variables, the array variable exp traces the equations that have been 
“expanded” (the body variables are put into the active set), while the variable 
in keeps track of the variables that have been put into the active set so far 
due to an expansion (to avoid putting the same variable multiple times in the 
active set due to function body expansion). 

In [9], it is shown that the above algorithm behaves correctly. 


Proposition 90 ([9]). Given a monotone system of equations S = (L,V,f), 
then after a finite number of steps, Solve(S,Q) determines a mapping v that 
equals to Ifp(£) on Q, i.e., vig = Ifp(£)¡0- 


See [428] for the computational complexity of the Solve procedure. 


A query driven top-k procedure. We are ready now to describe the query 
driven top-k algorithm [428]. 

The procedure TopAnswers is detailed in Algorithm 8.? Basically, we will 
compute answers iteratively one by one together with a threshold 6 such that 
if we have already collected k answers with a score greater than or equal to 
ô we can stop, as any not yet computed answer will have a score no greater 
than 6. 

Roughly, we proceed as follows. Suppose q is the query. We start with 
putting the predicate symbol q in the active set of predicate symbols A. At 
each iteration step we select a new predicate p from A, look for the next highest 
scoring tuple for p (procedure getNextTuple), update the current answer set 
for p (rankedList(p)), add all predicates p”, whose rule body contains p (the 
parents of p), to A, i.e., all predicate symbols that might depend on p are put 
in the active set to be examined, and finally we update the threshold. If we 


2The condition rL’ = rankedList means that the contents do not change. 
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have already gathered k answers for q whose score is greater than or equal to 
the threshold we stop. Our procedure uses also some auxiliary functions and 
data structures: 


e for predicate symbol p;, s(p;) is the set of predicate symbols occurring 
in the rule body of a rule having p; in its head”, i.e., the sons of p;, from 
which we exclude the extensional predicates. p(p;) denotes the parents 


of pi, i.e., p(pi) = {pj | pi € s(p;)); 


e the variable rankedList contains, for each intensional relation p, the 
current ranked tuples together with their score; 


e the variable Q is a global variable. It is used in the get NeztT'uple proce- 
dure and contains, for each intensional relation p, the next top-ranked 
tuple to be returned. The tuples in Q(p) are ranked in decreasing order 
with respect to the score s. 


Algorithm 8 TopAnswers(£,K, q, k) 


Input: Truth space £, KB K = (F,P), query relation q, k > 1. 
Output: Mapping rankedList such that rankedList(q) contains top-k answers of q. 
:ó=1 
: for all predicates p in P do 
if p intensional then 
rankedList(p) = Ø, Q(p) :=0 
if p extensional then 
rankedList(p) = Tp 
repeat 
if A=( then 
A := {q}, de := {q}, in := 0, rL’ := rankedList, initialise all pointers in the 
relations tables to point to the first record 
10: for all intensional predicates p do 
11: exp(p) = false 
12: select p € A, A := A \ {p}, dg := dg U s(p) 
13: (t, s) := getNextTuple(p) 
14: if (t,s) 4 null then 


Or ET EA E ea 


15: rankedList(p) := rankedList(p) U((t,s)), A := AU (p(p) N dg) 
16: if not exp(p) then 
17: exp(p) = true, A := AU (s(p) Lin), in := in U s(p) 


18: Update threshold ô 

19: until (rankedList(q) does contain k top-ranked tuples with score greater or equal than 
query rule threshold) or ((rL’ = rankedList) and A = 0) 

20: return top-k ranked tuples in rankedList(q) 


Note that the termination of the algorithm is guaranteed by the finiteness of 
the knowledge base, the finiteness of the truth set, and the monotonicity of 
the score combination functions: each tuple may enter a ranked list at most 
h = |£| — 1 times and we stop as soon as two successive loops in TopAnswers 
do not change the ranked lists and the queue A becomes empty.* 


3Recall that there may be more than one rule having p; in its head. 
4(rL' = rankedList) and A = Í. 
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Algorithm 9 getNextTuple(p) 


Input: Intensional relation symbol p. 
Output: Next instance of p together with the score. 
1: Consider set of rules R = {r | r : p(x) — f(A1,..., An) € P} 


2: Let p; be the relation symbols occurring in A; 

3: if Q(p) 40 then 

4: (t,8) = getTon(a(p)) 

5: remove (t,s) from Q(p) 

6: return (t,s) 

7: repeat 

8: forallreR do 

9: Generate the set T' of all new valid join tuples t for rule r, using tuples in 
rankedList(p;) and square ripple join (see [212]) 

10: for all t € T do 

11: s := compute the score of p(t) using f 

12: if neither (t, s’) € rankedList(p) nor (t, s’) € Q(p) with s < s’ then 

13: insert (t, s) into Q(p) 


14: until Q(p) 4 Ø or no new valid join tuple can be generated 
15: if Q(p) #0 then 

16: — (t, s) := getTop(Q(p)) 

17: remove (t, s) from Q(p) 

18: return (t,s) 

19: else 

20: return null 


We now describe the getNextTuple procedure (see Algorithm 9). Its main 
purpose is, given a relation symbol p and the rules r; : p(x) < y; having 
p(x) as head, to get back the next tuple (and its score) satisfying the body 
conditions of some of these rules using the so far retrieved tuples for the 
relations occurring in (;. 

Whenever we already have some join combinations for p in the queue 
Q(p) (obtained by a previous call) then we just return the top-ranked one. 
Otherwise, we take into account all rules r having p in its head. For each 
rule r, we try to generate join combinations for p, involving all seen tuples 
of the relations occurring in the rule body of r. For each join combination 
we compute its score. We put the results on the queue Q(p) and return the 
top-ranked one. As Q(p) may still contain answers for p, the next time we ask 
for a next tuple with respect to p, we access Q(p) directly. 


Example 56 ([428]). Consider the rule 


r:plx,z) + pi(x,y) - poly, 2) . 


Assume that actually rankedList(p,) and rankedList(p2) contain the fol- 
lowing tuples. 


recld | rankedList(p1) | rankedList(p2) 
1 ab 1.0 m h 0.95 
2 e J} 09 |m j 0385 
3 Im 08 (7 k 05 
4 c d 0.7 m n 0.65 
5 o p 0.6 p q 0.55 
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It can be verified that a call to getNextTuple(p), will put two join combina- 
tions (1,h,0.76) and (Î,j, 0.68) into Q(p). Therefore, the procedure will return 
(1, h,0.76) as the result and remove (l, h, 0.76) from Q(p). 

The second call of getNextTuple(p), we will return (1,3, 0.68) as Q(p) is 
still non-empty, and remove this tuple from Q(p) (which makes it empty). 
The third call of getNextTuple(p), it will return (e, f,0.675). The fourth 
call of getNextTuple(p), it we return (l,n,0.52). Finally, in the fifth call of 
getNextTuple(p), it will return (o, q, 0.33). 


Threshold for querying an extensional knowledge base. Assume that 
we have a knowledge base in which the rule component consists of one rule 
only (the query rule) of the form 


r : q(x) fF (Dis P2,- --; Da) 5 


where all p; are extensional predicates. Example 56 is such a case. In this case, 
the threshold 6 is determined as in [212], which we illustrate next. 

Let t? be the last tuple seen in rankedList(p;) so far with respect to 
rule r, while let t; be the top ranked one in rankedList(p;). With t.score 
we indicate the score of tuple t.5 Then we define ô” as the maximum of the 
following n values: 


ôi =  f(ti.score,ta.score,...,tn.score) 
65 = f(ti.score,t3.score,...,tn.score) 
0, =  f(ti.score,ta.score,...,t;,.score) . 


Finally, we define the threshold to be used in the TopAnswer procedure as 
6 = 67. For instance, for 


q(x) — pı (x,y) - pa(y, 2) 
we have 


ôi = t}.score: tz.score 
65 =  ti.score-t5.score 
6” = max(67,65) . 


Example 57 (Example 56 cont.). After the first call of getNextTuple(p) we 
have that 


ti ti ta t} 
(a, b, 1.0) | (l, m, 0.8) | (m, h, 0.95) | (m, 7, 0.85) 


5If no tuple has been yet seen in p;, then t.score = 1 is assumed. 
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and, thus, 


ô = 0.8-0.95 = 0.76 
65 = 1.0-0.85 = 0.85 
6” = max(0.76,0.85) = 0.85 . 


The important fact is now that whenever we consider a new join combination 
for rule r, its score will be less than or equal to 6". Indeed, if we consider a 
new join tuple using the next unseen tuple from rankedList(p,) and a seen 
tuple in rankedList(p2), its score will be less than or equal to 67, while if we 
consider a new join tuple using the next unseen tuple from rankedList(pa) 
and a seen tuple in rankedList(p1), its score will be less than or equal to 03. 
Therefore, overall the score will be less than or equal to 6”. 


Example 58 (Example 57 cont.). In the second call to getNextTuple(p), we 
get (l, j,0.68) and 0.68 < 0.85 = 6", in the third call to getNextTuple(p), we 
get (e, f, 0.675) and 0.675 < 0.85 = 6", in the fourth call to getNextTuple(p), 
we get (l, n,0.52) and 0.52 < 0.76 = 9” (note that after the third call, 9” = 
0.76), while the fifth call to getNextTuple(p), we get (o, q,0.33) and 0.33 < 
0.665 = 9" (after the fourth call, 6" = 0.665). 


As a consequence, whenever we have top-k answers for q with a score 
greater than or equal to 6", we can stop the retrieval process (see step 9 
of TopAnswers). 

This property can be generalized to n-ary joins (see [212], Theorem 4.2.1). 
The following result holds: 


Proposition 91 ([428]). For a knowledge base in which the rule compo- 
nent consists of one rule r only (the query rule) of the form r : q(x) + 
f(p1,p2,...,Pn), where all p; are extensional predicates, then the threshold- 
based method correctly reports the top-k results ordered by the score. 


We next show that we may easily extend our threshold-based method to the 
case that for a query we have a knowledge base in which the rule component 
consists of m > 1 rules (the query rules) of the form 


ri : q(x) E fi (Pi, P3,- P, ) 


Tm : de) = fil Pret Pan) 


where all pi are extensional predicates. In this case, let 6”: be the threshold 
for rule r;, as computed previously. Now, let 4% be 


51 = max(d",...,6"™) . (11.38) 


Finally, we define 6 = 6%. 
It is easily verified that Proposition 91 holds for this more general case as 
well. 
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Proposition 92 ([428]). For a knowledge base in which the rule compo- 
nent consists of rules r+,...,rm having q(x) as their head (the query rules), 
where the arguments of the rules are extensional predicates and 6 = 61 = 
max(671,...,67), then the threshold-based method correctly reports the top-k 
results ordered by the score. 


Threshold for querying an intensional knowledge base. We address 
now the general case [428]. For this purpose consider the following example. 

The solution provided by [428] is that we need to take into account a 
threshold for each intensional predicate related to the query as they may 
depend on each other. 

Specifically, let us consider a knowledge base in which an intensional pred- 
icate is head of one rule only (we address the case p is head of more than one 
rule later on, similarly as we did previously). So, for an intensional predicate 
p heading exactly one rule r 


r: p(x) — fípi(z1),... ,Pn[Zn)) 


we consider a threshold variable 9”. With r.t,, (?.t,,) we denote the last 
tuple seen (the top ranked one) in rankedList(p;) with respect to rule r. We 
assume that by default 7.t,,.score = 1 if no tuple is rankedList(p;). For an 
intensional predicate p;, we define 


pi = max(d",7.t,,.score) 


R= 0%, 


while if p; is an extensional predicate, we define 


Pi =  T.+tp,.score 


Di =  T.tp,.score . 
Now, for each rule r we consider the equation 6(r) 


6” = max(f (pr, pa, TER Pa), F (pi pz, E Pa), S .f(p',p',- ee pa) e (11.39) 


Eventually, for a knowledge base K = (F, P}, we consider the set A of all 
equations involving intensional predicates. Note that, if K has m intensional 
predicates, A consists of m equations and m variables. As all equations in- 
volve monotone functions only, the system of equations is monotone, A has a 
minimal solution, denoted A. Finally, for a query q(x), the threshold 5 of the 
TopAnswers algorithm is defined as 


= 0t, 


where 64 is the solution to variable 9% in the minimal solution A of the set of 
equations A. 
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Please note that A and, thus, 6, can be computed iteratively as described 
in Equation 11.37. Of course, if specific functions are involved only (e.g., linear 
functions), then better methods may be available to compute the minimal 
solution. We may also apply the Solve(A, {57}) algorithm to compute 67 in a 
more effective and query driven manner. 

In case an intensional relation p is in the head of more than one rule, e.g., 


ru: p(x) = (xy) 
eer emer ee, (11.40) 


we proceed as previously in case of empty logic program in which now 0? is 
the maximum among the thresholds computed for rule rı and ra according to 
Equation (11.39). That is, for two new relations p’, p”, we define 


5? = max(dP 97) (11.41) 


where 5?’ , 6?” are the thresholds computed according to Equation (11.39) for 
the two rules 
p(x) += il(x,y) 


Pink a (11.42) 


Example 59 (Example 49 cont.). Let us determine top-3 answers for path. 
It can be verified that the equations for the threshold computation are: 

¿path = max(dP2th’ ; grata”) 

gpath’ 71 .teage-score 


path’ — max(min(6P3k, #5.teage.score), min(max(d?**", fo .tpatn.score), ra.teage-score)) 


The computation is shown below (we use the abbreviations p,p',p”,e for 
path, path’, path” and edge, respectively). 


TopAnswers 
scores 
Iter Alp (t, s) rankedList(p) ôP oP oP ri-te Pa tp ro.te fo.te 
Loop 1 1.0 | 1.0 [ 1.0 1.0 1.0 1.0 1.0 
1. p | p’ | (c, b, 0.6) (e, b, 0.6) 1.0 | 0.6 | 1.0 0.6 1.0 1.0 1.0 
2; p |p (a, c, 0.5) (e, b, 0.6), (a, c, 0.5) 0.5 | 0.5 | 0.5 0.5 0.6 0.4 0.6 
35 p |p (c, a, 0.4) (e, b, 0.6), (a, c, 0.5) 0.5 | 0.5 | 0.5 0.5 0.6 0.4 0.6 
(e, a, 0.4) 
4. p |p (a, b, 0.5) (e, b, 0.6), (a, c, 0.5), 0.4 | 0.4 | 0.4 0.4 0.6 0.4 0.6 
(a,b, 0.5), (c, a, 0.4) 
Stop, return(c, b, 0.6), (a, c, 0.5), (a, b, 0.5) 


Note that further answers for path have a score not greater than the threshold 
pr =a 


Proposition 93 ([428]). Given a knowledge base, the generalized threshold- 
based method correctly reports the top-k results ordered by the score. 


We refer the reader to [428] for further insights on the top-k retrieval algorithm 
and for the computational complexity analysis of the TopAnswers procedure. 
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Appendix A 


RDFS Semantics and Inference Rules 


For the sake of completeness, we reproduce here the definitions and axioms of 
the normative semantics of RDFS [307, 362] consisting of a model theory and 
axiomatic triples. We also recap a complete inference rule set. 

The set rdfsV stands for the RDFS vocabulary and is illustrated in Ta- 
ble A.1 with the shortcuts we use in brackets. The first column shows built-in 
classes, and the second and third show built-in properties. 

An interpretation T over a vocabulary V is a tuple 


Z = (Ar, Ap, Ac, Az, PE], CLI, 7) > 


where Ar, Ap, Ac, Ay are the interpretation domains of Z, which are finite 
non-empty sets, and Pl], C[-],-7 are the interpretation functions of Z. They 
have to satisfy: 


1. Ar are the resources (the domain or universe of T); 

2. Ap are property names (not necessarily disjoint from Ap); 
3. AcC Ar are the classes; 

4. Ap C Ap are the literal values and contains LN V; 

5 


. PĮ] is a function P|]: Ap > 24rx4r; 


TABLE A.1: RDFS vocabulary. 


rdfs: Resource [res] rdf:type [type] rdfs:isDefinedBy [isDefined] 
rdf:Property [prop] rdfs:domain [dom] rdfs:comment [comment] 
rdfs:Class [class] rdfs:range [range] rdfs:label [label] 
rdfs:Literal [literal] rdfs:subClassOf [sc] rdf:value [value] 
rdfs:Datatype [datatype] rdfs:subPropertyOf [sp] rdf:nil [nil] 
rdf:XMLLiteral [xmlLit]  rdf:subject [subj] rdf:_1 [1] 
rdfs:Container [cont] rdf:predicate [pred] rdf: 2 [2] 

rdf:Statement [stat] rdf:object [obj] 2i 

rdf:List [list] rdfs:member [member]  rdf:_i [-i] 

rdf:Alt [alt] rdf:first [first] 

rdf:Bag [bag] rdf:rest [rest] 

rdf:Seq [seq] rdfs:seeAlso [seeAlso] 


rdfs:ContainerMembershipProperty [contMP] 


269 


270 


TABLE A.2: RDFS axiomatic triples. 


type, type, prop) 
first, type, prop) 

2, type, prop) 

nil, type, list) 

sp, dom, prop) 

obj, dom, stat) 
seeAlso, dom, res) 
value, dom, res) 

sp, range, prop) 

obj, range, res) 
isDefined, range, res) 
alt, sc, cont) 
isDefined, sp, seeAlso) 
_1, type, contMP) 
-2, dom, res) 


subj, type, prop) 
rest, type, prop) 


( 

( 

(ty oe, dom, res) 
(sc, dom , class) 
(member, dom, res) 
(isDefined, dom, res) 
(type, range, class) 
(sc, range, class) 
(member, range, res) 
(comment, range, literal) 
(bag, sc, cont) 

(xmlLit, type, datatype) 
(1, dom, res) 

(2, range, res) 


6. C[:] is a function C]-]: Ac > DAR. 


pred, type, prop) 
value, type, prop) 


dom, dom, prop) 
subj, dom, stat) 
first, dom, list) 
comment, dom, res) 
dom, range, class) 
subj, range, res) 
first, range, res) 
label, range, literal) 
seq, sc, cont) 
xmlLit, sc, literal) 
_1, range, res) 
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obj, type, prop) 
1, type, prop) 


( 

(= 

(range, dom, prop) 
(pred, dom stat) 
(rest, dom list) 
(label, dom , res) 
(range, range, class) 
(pred, range, res) 
(seeAlso, range, res) 
(value, range, res) 
(contMP, sc, prop) 
(datatype, sc, class) 
(2, type, contMP) 


7. Y maps each t € ULN V into a value t? € Apr U Ap, and such that -7 
is the identity for plain literals and assigns an element in Ar to each 


element in L. 


An interpretation Z is a model of a ground graph G, denoted Z 


= G, if 


and only if Z is an interpretation over the vocabulary rdfsV U universe(G) 
that satisfies the RDFS axiomatic triples [307, 362] (see Table A.2) and the 
semantic conditions as in Table A.3. 

A ground graph G entails a ground graph H if and only if any model of 
G is also a model of H. A graph G entails a graph H, denoted G = H, if 
and only if for any grounding G’ of G there is a grounding H’ of H such that 
GC EH”. 

Next, we provide a sound and complete deductive system for RDFS. The 
system is arranged in groups of rules that capture the semantic conditions 
of models. In every rule, A, B,C, X, and Y are meta-variables representing 
elements in UBL and D, E represent elements in UL. The rules are described 
below. 


1. Simple: 
(a) & fora map p:G! >G 


(b) g for G Cc G 

2. Subproperty: 
A,sp,B),(B,sp,C 
(a) ae 


D,sp,E),(X,D,Y 
(6) PRA 


( ) (A,type,contMP) 
iz (A,sp,member) 
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TABLE A.3: Semantic conditions for classical RDFS interpretations. 
Simple: 1. for each (s,p,0) € G, p? € Ap and (s*, 07) € P[p7]; 


RDF: 


= 


. p € Ap if and only if (p, prop?) € Pltype?]; 
2. if l € universe(G) is a typed XML literal with lexical form w then 

IZ is the XML literal value of w, I? € Az and (17, xmlLit?) € Pltype?]; 
RDFS: . x € Ap if and only if x € C[res7]; 
. x € Ac if and only if x € C[class*]; 
x € Ar if and only if x € Clliteral7]; 
. if c € Ac then (c, res?) € P[sc*]; 
„if x € Cldatatype?] then (a, literal?) € P[sc7]; 
„if x € C[contMP7] then (a, member?) € P[sp7]; 


PAPP» 


Subproperty: 1.P [sp*] is reflexive and transitive over Ap; 
2. if (p,q) € P[sp*] then p,q € Ap and Plp] € Pla]; 


Subclass: 1. P[sc?] is reflexive and transitive over Ac; 
2. if (c,d) € P[scZ] then c,d € Ag and C[c] € C[d]; 


Typing: 1. x € Cle] if and only if (x,c) € Pltype?]; 
2. if (p,c) € P[dom*] and (x,y) € Plp] then æ € C[c]; 
3. if (p,c) € Plrange?] and (x,y) € Plp] then y € C[c]; 


3. Subclass: 


(A,sc,B),(B,sc,C) 
(a) (A,sc,C) 


A,sc,B),(X,type,A 
O o 


(A,type,class) 
(c) (A,sc,res) 


(d) (A,type,datatype) 
(A,sc, literal) 


4. Typing: 


(D,dom,B),(X,D,Y) 
(a) (X, type, B) 


D,range,B),(X,D,Y 
(b) eal 


(A,D,B) 
(c) (D,type,prop) 


(A,D,B) 
(d) (A,type,res) 


(le) apezi if | € universe(G) is a typed XML literal 
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5. Implicit Typing: 


(A,dom,B),(D,sp,A),(X,D,Y) 
(a) (X,type, B) 


(b) (A,range,B),(D,sp,A),(X,D,Y) 
(Y, type, B) 


6. Subproperty Reflexivity: 


(A,type,prop) 
(A,sp,A) 


7. Subclass Reflexivity: 


(A,type,class) 
(A,sc, A) 


A proof is defined as follows. Let G and H be RDFS graphs. Then G HEPES H 
iff there is a sequence of graphs P;,,..., Pg with Pi = GU {r | 7 is an RDFS 
axiom as in Table A.2 } and P, = H, and for each j (2 < j < k) one of the 
following holds: 


1. there exists a map u : P; > P;-1 (rule (1a)); 
2. P; E Pj- (rule (1b)); 


3. there is an instantiation i of one of the rules (2)(5), such that R C P;_1 
and P; = yess) U R. 


The sequence of rules used at each step (plus its instantiation or map), is 
called a proof of H from G. Now it can be shown that 


Proposition 94 (Soundness and Completeness [307]). For RDFS graphs G 
and H, 
G rors H if and only if GEPPES H. 


Moreover, if G HEPFS H then there is a proof of H from G where the rule 
(la) is used at most once and at the end. 


Appendix B 


From OWL to Description Logics 


B.1 The Case of OWL 2 


The syntax of OWL 2 expressions and the mapping to SROTQ(D) is 
essentially as follows. In the following, A is an atom (unary predicate), R is 
a role (binary predicate), RT is the inverse of role R, S is a simple role, T 
is a datatype property, a,b are individuals, and l is a literal of the concrete 
domain supported by OWL 2. Symbols may have a subscript. 

An object property R may also be the top relation U or the bottom relation 
1, with obvious extension UZ = A? x Al and 1,2 = 0, respectively, for any 
interpretation Z. 


Assertion axioms. Assertion axioms are of the form: 


Samelndividual(a, b) — a = b. Note that in Section 4.2.2.3 we used UNA instead. 
DifferentIndividuals(a, b) > a 4 b 
ClassAssertion(C, a) > a:C 
ObjectPropertyAssertion(R, a,b) > (a, b):R 
NegativeObjectPropertyAssertion(R, a, b) > 
DataPropertyAssertion(T, a,l) > (a, 1):T 
NegativeDataPropertyAssertion(T, a, l) + 


Class Expressions. We recap here briefly the syntax of OWL 2 class ex- 
pressions. 
C —> A | CnC || C1 UC2 | aC | {a} | 
R.C | YR.C | 3R.{a} | 3S.Self | 
>n 8.0) | (< n S.C) | (= n S.C) | 
T.d | YT.d | IT. =,] 
> n T.d) | (< n T.d) | (= n T.d) 


l 
l 


Class Axioms. The class axioms of OWL 2 are of the following form 


SubClassOf (C1, Ca) > Ci L Ca 
EquivalentClasses(C1, C2) — C1 = Ca 
DisjointClasses(C1, 02) > disj(C1, C2) 
DisjointUnion(C, C1, C2) + C = Cy U C2, disj(C1, C2) 
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Object Property Axioms. The object property axioms of OWL 2 are of 
the following form 


SubObjectPropertyOf(Ri, R2,..., Rn, R) > Ri...R, ER 
EquivalentObjectProperties( R1, Ra) + Ri E Ra, Ra E Ri 
DisjointObjectProperties(S1, S2) ++ disj(S1, S2) 
InverseObjectProperties( R1, R2) + EquivalentObjectProperties( R1, R3 ) 
ObjectPropertyDomain(R, C) > dom(R, C) 

ObjectPropertyRange(R, C) > ran(R, C) 

FunctionalObjectProperty(S) — fun(R) 
InverseFunctionalObjectProperty(S) > fun( R7) 
ReflexiveObjectProperty(R) > ref(R) 

IrreflexiveObjectProperty(S) > irr(R) 

SymmetricObjectProperty(R) + sym(R) 

AsymmetricObjectProperty(S) — asy(R) 
TransitiveObjectProperty(R) > trans(R) 


Data Property Axioms. The data property axioms of OWL 2 are of the 
following form 


SubDataPropertyOf(T), T2) > T E Ta 
EquivalentDataProperties(T), Ta) > Ti E Ta, Ta ET; 
DisjointDataProperties(T1, T2) > disj(T1, Ta) 
DataPropertyDomain(T, C) > dom(T, C) 
DataPropertyRange(T, d) > ran(R, d) 
FunctionalDataProperty(T) > fun(T) 


Datatypes. We refer the reader to http://www.w3.org/TR/2008/WD-owl2- 
syntax-20081008/#Datatype_Maps concerning the datatypes supported by 
OWL 2. 


Simple roles. An object property R is composite w.r.t. a set of object prop- 
erty axioms FR iff 


1. Ris either T, or Lp; or 

2. R contains an axiom of the form 
(a) SubObjectPropertyOf(Ri, R2,..., Rn, R); 
(b) SubObjectPropertyOf(R1, Ro,...,Rn, R7); 
(c) TransitiveObjectProperty(R); 
(d) TransitiveObjectProperty(R7 ); 


The relation — w.r.t. R is the smallest relation for which the following con- 
ditions hold: 
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1. If R contains SubObjectPropertyOf(R1, Ra), then Ri > Ra; 


2. If R contains EquivalentObjectProperties(Ri, Ro), then Ri > Rz and 
Ra > Ra; 


3. If R contains InverseObjectProperties(R¡, R2), then Ry > R3 and Ry > 
Ra; 


4. If R contains SymmetricObjectProperty(R), then R > R7; 
5. If Ry > Ra then Ri => Ry. 


Let —* be the reflexive-transitive closure of >. 

An object property expression R is simple w.r.t. R if, for each object 
property R’ such that R’ —* R holds, R’ is not composite. 

We also assume that no SubObjectPropertyOf axiom contains the uni- 
versal role U property (similarly, no SubDataPropertyOf axiom contains the 
universal datatype property) and that R is regular (see Section 4.2.2.3). 


B.2 The Case of OWL QL 


The syntax of OWL QL expressions and the mapping to DL-Lite is es- 
sentially as follows. In the following, A is an atom (unary predicate), R is a 
role (binary predicate) and R” is the inverse of role R. Symbols may have a 
subscript. The mapping to DL-Liter is derived from the one for OWL 2 (see 
Section B.1). 


Assertions. Assertion axioms are of the form: 


ClassAssertion(A, a) 
ObjectPropertyAssertion(R, a, b) 


Class Expressions. In OWL 2 QL, there are two types of class expressions. 
The B sub class production defines the class expressions that can occur as sub 
class expressions in SubClassOf axioms, and the C super class production 
defines the classes that can occur as super class expressions in SubClassO f 
axioms. Their syntax is: 


B — A|3R 


Class Axioms. The class axioms of OWL 2 QL are of the following form. 


SubClassOf(B, C') 
EquivalentClasses( B1, B2) 
DisjointClasses( B1, B2) 
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Property Axioms. The property axioms of OWL 2 QL are of the following 
form 


SubObjectPropertyOf(R1, Ra) 
EquivalentObjectProperties(R,, R2) 
DisjointObjectProperties( R1, Ra) 
InverseObjectProperties( R1, R2) 
ObjectPropertyDomain(R, C) 
ObjectPropertyRange( R, C) 
ReflexiveObjectProperty(R) 
SymmetricObjectProperty(R) 
AsymmetricObjectProperty(R) 


Note that there are some differences among DL-Liter and OWL 2 QL: 


1. existential quantification to a class AR.C; 
2. symmetric property axioms and asymmetric property axioms; 
3. reflexive property axioms and irreflexive property axioms. 


Notice that, although 1 and 2 are not natively supported by DL-Liter, they 
are actually expressible in DL-Liter by suitably processing the intensional 
level of the ontology. Conversely, reflexivity and irreflexivity axioms are brand 
new features and require new inference mechanisms. 


Native handling of qualified existential quantification. It is well known 
that an axiom 7 of the form 


BCARA 


can be replaced with the transformation 


Ether {B = JRauz, Rauz = R, JR A} > 


aut — 


where Raus is a new role. We point out that [100] provides also a query 
reformulation method that does not require the translation. 


Native handling of symmetric and asymmetric object property ax- 
ioms. 


1. each SymmetricObjectProperty(R) is managed by adding R E R- to the 
DL component; 


2. each AsymmetricObjectProperty(R) is managed by adding R E ~R to 
the DL component; 


Handling of reflexive and irreflexive object property axioms. It has 
been observed in [100] that since (i) the asymmetry axiom on a property 
implies the irreflexivity axiom on the same property; and (ii) the asymmetry 
axiom influences only the consistency check on the ontology; the irreflexivity 
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axiom influences only the consistency check, too. [100] shows how to deal with 
irreflexivity axioms in the consistency checking phase. 

On the other hand, reflexive role declarations impact the query reformu- 
lation phase, cannot be represented in DL-Liter, and they have to be dealt 
with specifically in the reformulation algorithm, though the solution is easy. 
Essentially, a reflexive role R means that R(x,xw) is always true, we have 
to appropriately drop any occurrence of R(x,x) occurring in a query dur- 
ing the query reformulation procedure. We refer the reader to [100] for the 
details. 


B.3 The Case of OWL EL 


We recap here briefly the syntax of OWL EL class expressions. In the 
following, A is an atom (unary predicate), R is a role (binary predicate), S is 
a simple role, and T is a datatype property. Symbols may have a subscript. 

The mapping to EL**(D) is derived from the one for OWL 2 (see Sec- 
tion B.1). 


Assertions.. Assertion axioms are of the form: 


ClassAssertion(A, a) 
ObjectPropertyAssertion(R, a, b) 


Class Expressions. OWL 2 EL class expression are as follows: 


C — A|C&NC | {a} |3R.C | IR.{a} | 39.Self 
3T.d | 


Class Axioms. The class axioms of OWL 2 EL are of the following form 


SubClassOf (C1, C2) 
EquivalentClasses(C4 , C2) 
DisjointClasses(C1, C2) 


Property Axioms. The property axioms of OWL 2 QL are of the following 
form 


SubObjectPropertyOf(R1, Ra) 
EquivalentObjectProperties( R1, Ra) 
ObjectPropertyDomain(R, C) 
ObjectPropertyRange( R, C) 
ReflexiveObjectProperty( R) 
TransitiveObjectProperty( R) 
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B.4 The Case of OWL RL 


We recap here briefly the syntax of OWL RL expressions. In the following, 
A is an atom (unary predicate), R is a role (binary predicate), S is a simple 
role, and T is a datatype property. Symbols may have a subscript. 

The mapping to Horn-DL is derived from the one for OWL 2 (see Sec- 
tion B.1). 


Assertions. Assertion axioms are of the form: 


ClassAssertion(C, a) 
ObjectPropertyAssertion(R, a, b) 


Class Expressions. In OWL 2 RL, there are two types of class expressions. 
The sub class B production defines the class expressions that can occur as sub 
class expressions in SubClassOf axioms, and the C super class production 
defines the classes that can occur as super class expressions in SubClassO f 
axioms. The D class production defines the classes that can occur in concept 
definitions. Their syntax is (m € {0,1}): 


B — Ala] BB | BB | ARB aR a ma 37 = 

C — A|C¡MCz|-B|VR.C | 3R4a) | YT.d | IT. =| 
(SmS B) | (<m8)|(<mTd) 

D => AR.{a} | AT. =| Dı M Do 

R — P| 


Class Axioms. The class axioms of OWL 2 RL are of the following form 


SubClassOf (B, C) 
EquivalentClasses( A, D) 
DisjointClasses( B1, B2) 


Property Axioms. The property axioms of OWL 2 QL are of the following 


form 


SubObjectPropertyOf(R,, Ra) 
EquivalentObjectProperties( R1, Ra) 
DisjointObjectProperties(S1, S2) 
InverseObjectProperties( Ra, Ra) 
ObjectPropertyDomain(R, C) 
ObjectPropertyRange(R, C) 
FunctionalObjectProperty(S) 
InverseFunctionalObjectProperty (5) 
IrreflexiveObjectProperty(S) 
SymmetricObjectProperty(R) 
AsymmetricObjectProperty(S) 
TransitiveObjectProperty(R) 


Appendix C 


A Tableau Calculus for SHIF, 


The major issue introduced by functional roles is that together with inverse 
roles they may cause a concept to be satisfiable in an infinite model only. 


Example 60. Consider the following KB K = (R,T, A) with 


R =p 
T = {TTSA} 
A = {anA}, 


where the inverse of the role S is functional, i.e., fun( S7). Then, K has a 
model, but no finite one. Indeed, a model T of K is 


AZ = N 

a? = 1 

1 g AT 

n € A? foralln>1 


(n,n+1) e ST foralln>1. 


This property is well-known for SHZF,. In order to cope with this issue, clas- 
sical DLs have proposed the so-called notion of pairwise blocking to guarantee 
the correct termination of a tableau from which then we may build a possibly 
infinite model (see, e.g., [209]). 

With respect to ALC, we need to extend some notions. The set of roles 
is the set of roles and their inverse. To avoid considering roles such as R7~, 
we define a function Inv such that Inv(R) = RT if R is an atomic role, and 
Inv(R) = Ru if R = R] . We also define a function Trans(R) to return true iff 
R is a transitive role, i.e., trans(R) or trans(Inv(R)) (note that R is transitive 
iff Inv(R) is transitive). 

We start with considering role axioms and role inclusion axioms. Given an 
RBox R, we define the sub-role relation ER as the transitive-reflexive closure 
of E over R’ = RU {Inv(R1) E Inv( Re) | Ri E R2 E R}, that is 


1. if R occurs in R’ then R E} R; 


2. if Ry E Ra ER’ then Ri ER Ra; 


3. if Ry ER€ R’ and Ro E R3 € R’ then Ri ER Rz. 
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We extend an interpretation Z to interpret Ri CR Ro as Z > Ri ER Ra 
iff Z E Ri E Rə. It is then easy to see that K | Ri CR Ra iff Ri CR Ro. 
This provides us a simple procedure to determine whether K = Ri CR Re. 

Let K be a SHZTF, KB. Recall SHZF, differs from SHZF as there are no 
local functional roles (< 1 R) occurring in K, but there are global functional 
roles fun(R) only. 

Now, a completion-forest F for K is a collection of trees whose distin- 
guished roots are arbitrarily connected by edges. Each node v is labelled with 
a set L(v) of concepts C € sub(K). The intuition here is that v is an instance 
of C. Each edge (v, w) is labelled with a set L((v, w)) of roles R occurring in 
K indicating that (v, w) and instance of R. 

If nodes v and w are connected by an edge (v, w) then w is called a successor 
of v and v is called a predecessor of w. Ancestor is the transitive closure of 
predecessor, and descendant is the transitive closure of successor. 

If nodes v and w are connected by an edge (v, w) with R’ € L((v, w)) and 
R' CR R, then w is called an R-successor of v and v is called an R-predecessor 
of w. If node w is an R-successor of v or an Inv(R)-predecessor of v, then w 
is called an R-neighbor of v. 

For a node v, £(v) is said to contain a clash iff [A, =A} C Liv). A 
completion-forest is called clash-free iff none of its nodes contains a clash; 
it is called complete iff none of the expansion rules in Table C.1 is applicable. 

Now, the algorithm initializes a forest F as follows: 


e F contains a root node vå, for each individual a; occurring in A; 


e F contains an edge (vj, vA), for each assertion axiom (a;,a;):R € A; 


e for each assertion a;:C € A, we add C to Lug) 


e for each (a;,a;):R € A, we add R to L((vj¿, vj)). 


We also need a technical definition involving functional roles (see [295]). Let F 
be forest, R a functional role such that we have two edges (v, w1) and (w, wa) 
such that R occurs in L((v,wi)) and L((v,we)), respectively. Then we call 
such a pair a fork. As R is functional, such a fork means that w and wa have 
to be interpreted as the same individual. Such a fork can be deleted by adding 
both L((v, w2)) to L((v,wi)), and £(w2) to L(w1), and then deleting node 
wə. Of course, as inverse roles are allowed these may also contribute to create 
a fork, i.e., we have a fork if w and wa are R-neighbors of v. 

We assume that forks are eliminated as soon as they appear (as part of a 
rule application) with the proviso that newly generated nodes are replaced by 
older ones and not vice versa. 

At the beginning, we check that there are no forks in the initial forest, 
otherwise the KB is not satisfiable (due to the unique name assumption, if we 
have (a, 6,):R and (a, b2):R with bi 4 bz then R cannot be functional). 

As next we define the notion of pairwise blocking (see Figure C.1). A node 
v is blocked iff it is not a root node and it is either directly blocked or indirectly 
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L(<vv>) = L(<w',w>) 


w' L(w) L(v) = L(w) 
L(<w',w>) L(v) = L(w) 
w L(w) 
v Lv) 
L(<v'v>) 
MITO 


FIGURE C.1: Pairwise blocking in SHZF. 


blocked. A node v is directly blocked iff none of its ancestors are blocked, and 
it has ancestors v’, w, w’ such that 


1. wis not a root node, 
2. visa successor of v”, 
3. w is a successor of w”, 
4. Lw) = L(w), 

5. L(v’) = L(w”), 

6. L((v”,0)) = L( (w, w)). 


The set of reasoning rules are shown in Table C.1. 

Note that a model Z from a complete and clash-free completion-forest can 
be built as follows [206]. The major problem now is that the model may be 
infinite. The main idea is that an individual in the domain A? intuitively 
corresponds to a path, in a complete completion-forest F, from a root node 
to some node that is not blocked, and that goes only through non-root nodes. 
To obtain an infinite model, these paths may be cyclic. Instead of going to 
a direct blocked node, these path go “back” to the blocking node and this 
an infinite number of times. Thus, if blocking occurred while constructing a 
complete and clash-free completion-forest, we obtain an infinite domain. More 
precisely, a path is of the form 


vo Un 


A E 
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TABLE C.1: The tableau rules for SHTF,. 


— 


). If (1) CIN Ca € Liv), (îi) {C1,C2} Z L(v) and (iii) node v is not 
indirectly blocked, then add C and Cə to L(v). 

). If (i) Cr U Ca e Llv), (îi) {C1, Co} N L(v) = Ý and (iii) node v is not 
indirectly blocked, then add some C € {C1, C2} to L(v). 

(V). If (i) VR.C € Lv), (îi) there is an R-neighbour w of v with C ¢ L(w) 
and (iii) node v is not indirectly blocked, then add C to L(w). 

3). If i) 3R.C € Liv), (ii) there is no R-neighbour w of v with C € L(w) 
and (iii) node v is not blocked, then create a new node w, add R to 
£((v, w)) and add C to L(w). 

(Vi). If (i) VS.C € Liv), (îi) there is some R with Trans(R) and R Ej S, 

(iii) there is an R-neighbour w of v with VR.C g L(w) and (iii) node v 
is not indirectly blocked, then add VR.C to L(w). 
(O). If @) TEDET, (ii) D g L(v) and (ii) node v is not indirectly blocked, 
then add D to L(v). 


— 


m 


where v;,v; are nodes of F. For such a path, we define Tail(p) = v,. With 
[p | +] we denote the path Ez „up 2, p++]. The set of paths Path(F) is 
n+1 


y) ore g 
Vn’ Unga 


defined inductively as follows: 
1. for root nodes vj in F, ES € Path(F); 
(0) 
2. for a path p € Path(F) and a node win F 


(a) if w is a successor of Tail(p) and w is neither blocked nor a root 
node, then [p | 2] € Path(F); or 


(b) if for some node v in F, v is a successor of Tail(p) and w blocks v, 
then [p | 7] € Path(F). 


Please note that, due to the construction of Path( F), for p € Path(F) with 
p = |p’ | 7], we have that w is not blocked, v is blocked iff w 4 v, and v is 
never indirectly blocked. Furthermore, £(w) = L(v). 

Now, let F be a complete and clash-free completion-forest constructed by 
the tableaux algorithm for K. 


e The domain of Z are the paths of the forest, 1.e., 
A7 = Path(F) . 
e The interpretation of an individual a; occurring in K is as follows: 


i 
a? = [0]. 
vo 
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e TI=AT, 17 = 9. 
e The interpretation of an atom A is as follows: 
At = {p € AF | v=Tail(p), A € L(v)} . 
e The interpretation of a role R is as follows: 
RE = {(p,q) € Path(F) x Path(F) | 


U 
q = |p| —] and v’ is a R-successor of Tail(p), or 
U 


p=Íq | 2] and v' is a Inv(R)-successor of Tail(q)} U 
v 


(2), [—]) | v,w are root nodes and w is an R-neighbor of v} . 
v 


It can be shown that: 
Proposition 95 ([209]). For each SHTF, knowledge base K 


1. the tableau algorithm terminates; 


2. if the expansion rules can be applied in such a way that they yield a 
complete and clash-free completion-forest, then K has a model; 


3. if K has a model, then the expansion rules can be applied in such a way 
that they yield complete and clash-free completion-forest for K; 


4. the KB satisfiability problem is NExpTime-complete [442]. 


Example 61. Consider the KB in Example 60. K is satisfiable as there is a 
clash-free complete completion-forest (see Figure C.2). Note that nodes x,y, 
and z form a pairwise blocking condition, with w =2,w=y=v' andv =z. 


FIGURE C.2: Clash-free complete completion-forest for SHTF KB. 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


Appendix D 


From RIF-Core to Datalog 


In this appendix, we provide a mapping of a significant subset of RIF-Core 
statements into Datalog with concrete domains. 


Syntax. To start with, a term t and atoms are defined as follows: 
1. a constant is a term; 
2. a variable is a term. 


An atomic formula is of the form p(t1,...,t,), where t; are terms. A formula 
@ has the following grammar (where A is an atom, p is a predicate symbol, 
and t; are terms) 


ọ > A | (atom) 
gr $2 | (conjunction) 
bi Vf | (disjunction) 
Ext(p(ti,...,tn)) | (external atom) 
Jr. (existential formula) . 


A fact is an atom (not necessarily ground), while a rule is of the form (n > 1) 


where A; are atoms and ¢ is a formula. 

Note that external atoms are used for representing built-in predicates as 
“procedurally attached” predicates, which might exist in various rule-based 
systems, but are not specified by RIF. 

For a rule AyA...A An + ¢, A1A...AA,, is called, head, while ¢ is called 
the body of the rule. Rules have to be safe, that is any variable occurring in 
an external term or atom have to also occur in the same rule body within a 
non external atom. Rules and facts are considered as universally quantified. 
In a fact or rule, the universally quantified variables are called distinguished 
variables, while the existentially quantified variables in a rule body are the 
so-called non-distinguished variables and are distinct from the distinguished 
variables (note that there are non existential quantifications in facts). 

A RIF-Core knowledge base is a pair K = (F,P), where F is a finite set 
of facts and P is a finite set of rules. 
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Remark 48. Let us note that RIF-Core [365] also provides some other forms 
of atoms, such as membership, equality, and frame atoms, which are however 
peculiar features of F-Logic and do not strictly belong to the Datalog realm as 
F-Logic is a Higher-Order Language. We refer the interested reader to [237, 
288]. Examples of such atoms are 


1. Frame atoms: johnlage — x, spouse — y], dictating that “the age of 
john is x, while his spouse is y”. We may well use variables in place of 
the attributes (predicate) age and spouse and the constant john, which 
makes RIF-Core a higher order logic; 


2. Membership term: john#Person, dictating that “john is a Person”. We 
may well use a variables in place of the constant john and the unary 
predicate symbol Person. 


Eventually, RIF-Core also employs “lists” as built-in terms and external func- 
tion calls. Specifically 


e if ti,...,t, are terms, then List(ti,...,tn) is a so-called closed list 
term; 
e if f is a function symbol, ti,...,t, are constants or variables then 


Ext(f(t1,...,tn)) is an externally defined term. 


We do not address these types of terms and atoms here. Note that such external 
terms can be simulated anyway via an external atom Ext(ps(t1,...,tn), £) 
for an appropriate external predicate pp where x will hold the result of the 
application of f to the arguments ty,...,tn.- 


Semantics. The semantics extend the one for Datalog with concrete domains 
(see Sections 5.2 and 5.3). Specifically, let K = (F,P) be a knowledge base 
and consider a finite datatype theory D = (AP, -?) including the support for 
all externally defined atoms occurring in K. 

The Herbrand universe Hx is the set of all constants occurring in K to- 
gether with the values in AP. The Herbrand base Bx of K is the set of all 
ground atoms that can be formed using constants in Ax and atoms occurring 
in K. 

The set of ground rules K* derived from the grounding of P is constructed 
as follows (we extend the language by allowing the ground atomic formula 1, 
denoting the truth true, to occur in the rule body): 


1. set K* to be 


K := {p(c) — 1 | p(c) € F}U {p(c) — 1 | p(x) € F,c € Ak} ; 


2. add to K* the set of all ground instantiations of rules in K, i.e., rules 
obtained by replacing distinguished and non-distinguished variables with 
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elements from the Herbrand universe Hx and from which we remove 
the existential quantification expression of the form 3x. Note that rule 
bodies of rules in K* are Boolean propositional formulae. 


An interpretation T for K is a subset of Be. Given K, we say that T satisfies 
(is a model of) 


e the ground atom 1, denoted Z = 1; 


e a ground atom A € Bx, denoted T = A, iff A € T; 

e a ground formula ¢, denoted Z = ¢ iff (inductively) 
1. ZEAif dis an atom A; 
2. T | ¢, and T HE po if ¢ is a conjunction ¢; A a; 
3. TF 0 or T E ¢ if ¢ is a disjunction ¢) V Pa; 
4. t € pP if ¢ is an external atom Ext(p(t)); 


e a ground rule r € K*, denoted Z [E r, iff Z is a model of the head of r 
whenever Z is a model of the body of r; 


e K*, denoted T |= K*, iff Z satisfies all rules r € K*; 
e K, denoted T E K, if Z is a model of K*. 


Let Ix = 2P* be the set of all interpretations (there are 2/PX! many). Now, 
for 71, Za € Ix, we write Zi < I iff Zi C Ta. It is easy to see that (Ix, <) is a 
finite complete lattice. 

The minimal model of K, denoted Mx, is the <-least model of K. The 
existence, finiteness, and uniqueness of the minimal model Mx is guaranteed 
to exist using the same argument as for Datalog (see Proposition 16). 


Mapping. The mapping of our RIF-Core sublanguage to Datalog is as follows. 
Facts are mapped as follows: 


ple) => plc) 
p(x) => plc) for all c € Hk. 


Concerning rules, consider a rule A; A...A A, + 6. First of all, we transform 
the rule body ¢ into a Prenez Disjunctive Normal Form (PDNF), which is of 
the form 


dx.(¢@1 V... V dr) j 


where each ¢; is a conjunction of atoms or external atoms. To do so, we apply 
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recursively the following rules (we assume that x 4 y)! 


32.b1 Ady.d2 +> 313ylb1 A 2) 

31.1 Adx.pa + Aaxdy.(o1 A ¢2[2/y]), where y new variable 

32.b1 VIy.p2 +> IrIy.(ı V $2) 

31.p1 Vx. da + Aady.(do1 V P2[2/y]), where y new variable 
3x.(01 A(d2 V $3)) = 3Ix.((01 A 02) V (01 A b3)) 


Now, consider a rule A; /A...AA, < ¢, where ¢ is in PDNF, we may eventually 
map it into a set of Datalog rules by applying the following two rules: 
AI1MN...AAnTO > Ar 90,... Anto 
A 3IX(01VW...VOr) > At 01... AC be. 


This concludes the mapping. 


"With ó[x/y] we denote, as usual, the formula obtained from ¢ by replacing all free 
occurrences of x with y (see, e.g., [93]). We also omit the rules obtained by commutativity 
of A and V. 


Appendix E 


Basic Logics to Deal with Uncertain 
Statements 


The aim of this section is to illustrate typical logics to deal with uncertainty, 
i.e., we illustrate a typical formalization of uncertain statements via a prob- 
abilistic and a possibilistic setting. Reasoning procedures are presented as 
well. 


E.1 Probabilistic Logic 


Probabilistic logic has its origin in philosophy and logic. Its roots can 
be traced back to Boole in 1854 [66]. There is a wide spectrum of formal 
languages that have been explored in probabilistic logic, ranging from con- 
straints for unconditional and conditional events to rich languages that spec- 
ify linear inequalities over events (see especially the work by Nilsson [332], 
Fagin et al. [151], Dubois and Prade et al. [8, 133, 134, 135], Frisch and Had- 
dawy [162], and [275, 276, 284]; see also the survey on sentential probability 
logic by Hailperin [185]). Recently, non-monotonic generalizations of proba- 
bilistic logic have been developed and explored; see especially [277] for an 
overview. In this section, for illustrative purposes, we recall only the simple 
probabilistic logic described in [332]. 

We first define probabilistic formulas and probabilistic knowledge bases. 
We assume a set of basic events ® = {p1,..., Pn} with n> 1. We use L and T 
to denote false and true, respectively. We define events by induction as follows. 
Every element of ®U{1,T} is an event. If d and y are events, then also 7d, 
(PAW), (9 V Y), and (4 > y) are events. We adopt the usual conventions to 
eliminate parentheses. 

A probabilistic formula is an expression of the form (¢,n), where q is 
an event, and n is a real number from the unit interval [0,1]. Informally, 
(6,n) says that ¢ is true with a probability of at least n. For example, 
(rain tomorrow, 0.7) may express that it will rain tomorrow with a proba- 
bility of at least 0.7. Notice also that (0,1 — n) encodes that ¢ is true with 
a probability of at most u. A probabilistic knowledge base K is a finite set of 
probabilistic formulas. 
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We next define worlds and probabilistic interpretations. A world I 
associates with every basic event in ® a binary truth value. We extend I 
by induction to all events as usual. We denote by Za the (finite) set of all 
worlds for ®. 

A world I satisfies an event ¢, or I is a model of ¢, denoted IE 4, iff 
I(¢) = true. 

A probabilistic interpretation Pr is a probability function on Za (that is, 
a mapping Pr: Ta — [0,1] such that all Pr() with I €Zẹ sum up to 1). 
Intuitively, Pr(I) is the degree to which the world I € Za is probable, that is, 
the probability function Pr encodes our “uncertainty” about which world is 
the right one, though, in any world a statement is either true or false. 

The probability of an event ¢ in Pr, denoted Pr(¢@), is the sum of all Pr(1) 
such that I € Ze and IE @. That is, 


Pr(¢) = X` Pr(). 


I=e 


The following proposition is an immediate consequence of the above defini- 
tions. 


Proposition 96. For all probabilistic interpretations Pr and events $ and y, 
the following relationships hold: 


Pr(9Ay) = Pr(9) + Pr(p) — Pr(9 V 4); 

Pr(9Ay) < min(Pr(9), Pr(v)); 

Pr(9AY) => max(0, Pr(6) + Pr(p) — 1); 

Pr(9vy) = Pr(9)+Pr(p) - Pr( Av); 

Prov) < min(1, Pr(¢) + Pr(v)); (E.1) 
Pr(9Vy) > max(Pr(¢), Pr(p)); 

PrOg) = 1-—Pr(p); 

Pr(1) = 0% 

Pr(T) =D 


A probabilistic interpretation Pr satisfies a probabilistic formula (4, n}, or 
Pr is a model of (¢,n), denoted Pr = (6, n), iff Pr(d) > n. We say Pr satisfies 
a probabilistic knowledge base K, or Pr is a model of K, iff Pr satisfies all 
FeEK. We say K is satisfiable iff a model of K exists. 

A probabilistic formula F is a logical consequence of K, denoted K EF, iff 
every model of K satisfies F. We say (¢,7) is a tight logical consequence of K 
iff n is the infimum of Pr(¢) subject to all models Pr of K. Notice that the 
latter is equivalent to n= sup {r| KE (¢,r)}. n is called the best entailment 
degree of p w.r.t. K (denoted bed(K, ¢)), i.e., 


bed(K, 6) = sup {r |K E (6,1), . 


The main decision and optimization problems in probabilistic logic are decid- 
ing the satisfiability of probabilistic knowledge bases and logical consequences 
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from probabilistic knowledge bases, as well as computing the best entailment 
degree from probabilistic knowledge bases, which can be done by deciding 
the solvability of a system of linear inequalities and by solving a linear op- 
timization problem, respectively. In particular, column generation techniques 
from operations research have been successfully used to solve large problem 
instances in probabilistic logic; see especially the work by Jaumard et al. [215] 
and Hansen et al. [191]. 

For the sake of completeness, we provide here a simple procedure to solve 
the KB satisfiability problem as well as the best entailment problem. In the 
following, we use variable y, as a placeholder for the value of Pr(I). Then 
a probabilistic knowledge base K has a model Pr iff the following system of 
linear constraints over the variables y, (J € Za), is solvable: 


for all yz, yr > 0 
Stine (E.2) 
for all (¢’,n) € K, Ip Yr >. 


Specifically, if y] is a solution to (E.2), then the probabilistic interpretation 
Pr(I) := yr (for all 1€ Za) is a probabilistic model of K. If no solution to 
(E.2) exists then K has no model. Note that (E.2) has exponentially as many 
variables yz, one for each possible classical interpretation T. 

Similarly, the best entailment degree bed(K, $) can be computed as 


minimize So TK YI subject to 


for all yr, yr >0 
il (E.3) 
for all (¢’,n) € K, rep Y n 
We point out that bed(K, 6) can be computed as well as 
bed(K, $) = min z. such that K U {(7¢, 1 — x) } satisfiable. (E.4) 


Informally, concerning the minimization in computing bed(K, $), suppose the 
minimal value of x is n. We will know then that for any interpretation Pr 
satisfying the knowledge base such that Pr(¢) < ñ, the starting set is not 
satisfiable (otherwise 7% wouldn't be minimal) and, thus, Pr(¢) > ñ has to 
hold. Which means that bed(K, 9) = n. 
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Therefore, we may rewrite (E.3) as follows: 


minimize x subject to 0<x<1 

for all yy, yr >0 
Dry =l (E.5) 
Veg Yr EZ 

for all (¢’,n) € K, Drege YI ZN 


In the above system we used the fact that 


S yr + Y yr=1 


I= Ile=6 
and, thus, the condition 
DE yr E=8 
IH 
is the same as 
5 yr Sr. 
16 


E.2 Possibilistic Logic 


We next recall possibilistic logic; see especially [140]. The main syntactic 
and semantic differences to probabilistic logic can be summarized as follows. 
Syntactically, rather than using probabilistic formulas to constrain the proba- 
bilities of propositional events, we now use possibilistic formulas to constrain 
the necessities and possibilities of propositional events. Semantically, rather 
than having probability distributions on worlds, each of which associates with 
every event a unique probability, we now have possibility distributions on 
worlds, each of which associates with every event a unique possibility and a 
unique necessity. Differently from the probability of an event, which is the sum 
of the probabilities of all worlds that satisfy that event, the possibility of an 
event is the maximum of the possibilities of all worlds that satisfy the event. 
As a consequence, probabilities and possibilities of events behave quite dif- 
ferently from each other (see Equations (E.1) and (E.6)). These fundamental 
semantic differences between probabilities and possibilities can also be used 
as the main criteria for using either probabilistic logic or possibilistic logic in 
a given application involving uncertainty. In addition, possibilistic logic may 
especially be used for encoding user preferences, since possibility measures can 
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actually be viewed as rankings (on worlds or also objects) along an ordinal 
scale. 

The semantic differences between probabilities and possibilities are also re- 
flected in the computational properties of possibilistic and probabilistic logic, 
since reasoning in probabilistic logic generally requires to solve linear optimi- 
sation problems, while reasoning in possibilistic logic does not, and thus can 
generally be done with less computational effort. Note that although possibil- 
ity measures can be viewed as sets of upper probability measures [141], and 
possibility and probability measures can be translated into each other [136], 
no translations are known between possibilistic and probabilistic knowledge 
bases as described here. 

We first define possibilistic formulas and knowledge bases. Possibilistic 
formulas have the form (P¢,n) or (Nd,n), where ¢ is an event, and n is a 
real number from [0,1]. Informally, such formulas encode to what extent ¢ 
is possibly resp. necessarily true. For example, (Prain_tomorrow, 0.7) encodes 
that it will rain tomorrow is possible to degree 0.7, while (Nfather — man, 1) 
says that a father is necessarily a man. A possibilistic knowledge base K is a 
finite set of possibilistic formulas. 

A possibilistic interpretation is a mapping 7: Ta — [0,1]. Intuitively, r(1) 
is the degree to which the world / is possible. In particular, every world I such 
that (1) =0 is impossible, while every world I such that (I) =1 is totally 
possible. We say m is normalized iff (I) =1 for some I € Za. Intuitively, this 
guarantees that there exists at least one world, which could be considered the 
real one. 

The possibility of an event ¢ in a possibilistic interpretation 7, denoted 
Poss((), is then defined by 


Poss(¢) = max (r(1)| 109) 


where max = 0. Intuitively, the possibility of ¢ is evaluated in the most pos- 
sible world where ¢ is true. The dual notion to the possibility of an event q 
is the necessity of ¢, denoted Nec(¢), which is defined by 


Nec(¢) = 1 — Poss(7¢) . 


It reflects the lack of possibility of —g, that is, Nec(6) evaluates to what extent 
¢ is certainly true. The following theorem follows immediately from the above 
definitions. 


Proposition 97. For all possibilistic interpretations m and events @ and y, 


294 Foundations of Fuzzy Logic and Semantic Web Languages 


the following relationships hold: 
Poss(pAy) <  min(Poss(9), Poss(y)); 


Poss(pV yv) = max(Poss(¢), Poss(y)); 
Poss(—9) = 1- Nec(9); 
Poss(L) = 0; 
Poss(T) = 1 (in the normalized case); 
(E.6) 
Nec(pAvj) = min(Nec(d), Nec(y)) ; 
Nec(pv y) 2 max(Nec(p), Nec(w)); 
Nec(7¢) = 1- Poss(¢); 
Nec(L) = 0 (in the normalized case); 
Nec(T) = da 


A possibilistic interpretation m satisfies a possibilistic formula (Pg, n) (resp., 
(Ng,n)), or m is a model of (Pd, n) (resp., (Ng,n)), denoted 7 H (Po, n) (resp., 
TE (Ng, n)), iff Poss(d) > n (resp., Nec(¢) > n). The notions of satisfiability, 
logical consequence, tight logical consequence and best entailment degree for 
possibilistic knowledge bases are then defined as usual (in the same way as in 
the probabilistic case). Specifically, we recall 


bed(K,P¢) = suplir | K =(Pó,r)) 
bed(K,Nd) = sup{r | K E (Nd,r)}. 


We refer the reader to [140, 199] for algorithms for possibilistic logic. However, 
for the sake of completeness, we provide here an analogous reasoning procedure 
as for the probabilistic case. 
Similarly to the probabilistic case (see Equation (E.4)), it can be shown 
that 
bed(K, Pp) = min z. such that KU (Pg < x} satisfiable, (E.7) 


where a possibility distribution m satisfies the expression Pg < x iff Poss(¢) < 
x. The problem above can be solved by the mixed integer linear programming 
problem (see [216], how the problem below converts to a mixed integer linear 
programming problem) with variables x,y; with I € Zo: 


min x. such that 0<x<1l 
for all J, 0<y<1l 
max{yr | 16) <a (8.8) 


for all (Pd’,n) EK,  max{yy|ITEd/}>n 


for all (Ng',n) e K, minl-y|lI4+49j>n. 
The case for bed(K, Nd) can be addressed similarly: 


bed(K, Nọ) = min z. such that K U {N¢ < x} satisfiable, (E.9) 
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where a possibility distribution 7 satisfies the expression Nọ < z iff Necc(¢) < 
x. Now, it suffices to change the third row in Equation (E.8) with 


min{l—y,|I Ep} <e. 
Of course, a possibilistic KB K is satisfiable iff the system 


for all J, O0<yr<l 
for all (Pd’,n) € K, maxtyr | I= p} >n (E.10) 


for all (NY n) €K, minf{l—y,|LAd}>n 


is satisfiable. Specifically, if y1 is a solution to (E.10), then the possibilistic 
interpretation (I) := y; (for all I € Za) is a possibilistic model of K. If no 
solution to (E.10) exists then K has no model. 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


Appendix F 


OR-based Inference Rules for P(D) 


We restrict our presentation to the case in which the truth space is Ln. So, 
let the truth space be L, and let us define 
1 
€ = ——— . FI 
2-(n—1) (Ea 
For a feature name f, we assume that f’s values are rational numbers, 
[fmin, fmax] is the range of allowed values in the concrete domain D for values 
f may take. We assume that there is some natural number p, such that for 
any feature name f, the smallest positive value f may take is equal or greater 
than 107?. With pe we define the value 


pe = 10-@ + , (F.2) 


We recall that any [0, 1]-valued variable x has to take values in Ly, t.e., 


il n— 2 
0 Eiza 1 
x E n—1? Fs 
which can be encoded as 
z=(n-1).x 


2€(0,1,...,n—1) (F.3) 


for a new integer variable z. 
The P(D) rules to be considered in the E-ORFuzzySat procedure (Sec- 
tion 8.2.2.2) are as follows. 


var). For variable x¿ occurring in Cx, set 
$ g 
Cr := Ck U {zg € [0,1], z = (n — 1) - 2,2 € {0,1,... n — 1}}, 


where z is a new integer variable. 
(var). For variable x4 occurring in Ck add x = 1 — og to Cx. 
(L). If 0 € Tx then Ck := Ce U {a9 = 0). 
(T). If 1 € Tx then Ck := Ce U {a1 =1). 
(A) 
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1. add ¢ and y to Tx 
2. Ck := Ck U {rg S Ty = Lonrwt- 
(V). IE VY Tr, then 
1. add ¢ and y to Tx 
2. Ck := Cp U {£4 D Ty = Tony} 
(>). If > y € Tk, then 
1. add nnf(=4) and y to Tx 
2. Ck := Cre U{(1- TN, > Ly = A 
(>). If $3 4 € Tr, then 
1. add nnf(=4) and y to Tx 
2. Ck :=Cxk U{(1— at > Ly = Tep). 
(>). If (f >m) € Tx, then 
Ck =CkUL 272 fmin (l-E >.)) FM TF >m) 
ap < (M= pe) (1— E >m)) + fmax ES >m)» 
Lf >m) € {0,1}, Ef E [fmin> fmax]} - 
(— >). Ifa(f >m) € Tx, then add (f >m) to Tk and set 
Ck = CKUITF >m) SL TU 2m)) - 
(<). If (f <m) € Tx, then 
Ck =CkU{ af S fmax: (l= Uf <m)) EM: LF <m) 
Tp > (M+ pe) (L= LF <m)) + fin Uf <m) 
Uf <m) € {0,1}, xf € iaa . 
(~<). Ifa(f <m) € Tx, then add (f <m) to Tk and set 
Ck = CK ULE <m) SL 2 <m)) > 
(=). If (f =m) € Tx, then 


Cr =CkU{ te > fmin: (oy ay) HM: Lf =m) 


Tf < fmax * (Ets y) +m> TIF =m)? 
Tf < max * (1 — yı) + sen — pe): Yı, 
Tf > fmin i (1 + ya) + (m + De) ` Y2, 


TIF =m) T Y1 + Ya = 1, 
L(f =m) € (0, 1), yi € (0, 1}, £f € | fain fmax| } 5 


where y; are new variables. 


OR-base 
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(= =). Ifa(f =m) € Tx, then add (f =m) to Tk and set 


CR = Ce UL@(f =m) SL Uf =m)) - 
(ls). If (f Is(a,b)) € Tic, then 


CK =CkU{ TfT (fmax — a) -Y1 S fmax, 


Uf 
Tf 


ls(a,b)) 2 Yu 


T (fmin a a) “Ya 2 fin; 


Tf + (fmax — 6) Y2 < fmax, 
Tf + ZF ls(a,b)) * (b— a) > b- (b — fmin): (1 — ya), 
Tf + Ef ls(a,b)) * (b— a) < b— (a — fmax): (1 — y2), 
TfT (fmin — b) > Us 2 fmin, 


Uf 


Is(a,b)) £ 1 — ys, 


y +yo+ys=1l, 
Yi € (0, 1), 27 € [fmin; fax] > 


where y; are new variables. 


Els). If =(f Is(a,b)) € Tk 
Ck = 


„then add (f Is(a,b)) to Tk and set 


Ck U {exe Is(a,b)) = 1— Xf ls(a,b)) } : 


(rs). If (f rs(a,b)) € Tx, then 


Ck =CkU{ zf + 


TfT 
TfT 


07 + E(f rs(a,b)) * (b — a) > —a 
Ef + T(S rs(a,b)) * (b= a) <a 


cpt 


Uf rs 


Xf rs 


(fmax — a) -y1 < fmax, 
(a,b)) S1- Y, 

(fmin — a) de È fmin, 
(fmax — b) Ya < fmax 


— 
a 


fmax): (1 — ya), 
Fin) i (1 gi y2), 


= 
= 


(fmin SE b) * Y3 2 Tei 
(a,b)) > Y3, 


yı +y +yz = 1, 
Yi E {0, Ll}, ae € [fmin> fmax| y > 


where y; are new vari 


ables. 


(ars). If a(f rs(a,b)) € Tk, then add (f rs(a,b)) to Tk and set 


Ck = CKU cane: rs(a,b)) = 1- Uf rs(a,b))} . 
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(tri). If (f tri(a,b,c)) € Tk, then 


Ck =CeU{ xs +(fmax — a): yı < fmax, 
Lf tri(ab,c)) £ 1 — Y1, 
£f + (fmin — 4): Y2 > fmin, 
xf + (fmax — D) y2 < fmax, 
pF LIS tri(a,b,c)) * (b — a) 
Daba) (b — a) 
xf + (fmin = b): Y3 > fmin, 
£f + (fmax — C) Y3 < fmax, 
Uf TĂU tri(a,b,c)) * (c= b) > c- (c — fmin): (1 — y3), 
Tf + Xf tri(a,b,e)) *(€— 6) < c= (b — fmax) * (1 — ys), 
£f + (fmin ~ c): Y4 È fmin, 
Lf tri(a,b,c)) £ 1 — Ya, 
Yı +yz + Ya + ya =1, 
Yi € (0, 1),27 € asal) > 


a (a fmax): (1 — y2), 
a (b fmin): (1 — yo), 


2 
< 


a 
a 


where y; are new variables. 
(atri). If =(f tri(a,b,c)) € Tx, then add (f tri(a,b,c)) to Tk and set 


Ck = Cru ¡E tri(a,b,c)) = 1.= Tf ay) : 
(trz). If (f trz(a,b,c, d)) € Tk, then 
Ck =CKU { cpt (ma = a) -Yı < fmax, 
Xf trz(a,b,c,d)) < 1- Yi, 


Tf e (fmin a a) : Ya 2 dining 
TfT (faz ri b) -Y2 S max, 


—x5 + UF trz(a,b,c,d)) * (b — a) > —a + (a — fmax) * (1 — y2), 
Tf TIC trz(a,b,c,d)) * (b— a) < —a + (b — fmin) - (1 — ya), 
£f + (fmin — b) - Y3 > fmin, 

xf + (fmax — C) Y3 < fmax, 

Lf trz(a,b,c,d)) > Y3, 

£f + (fmin = 0) Y4 È fmin, 

xf + (fmax — d): Ya < fmax, 

Tf +L, tre(a,b,c,a)) * (d — c) > d— (d— fain) * (1— ya), 

Tf +27 trz(a,b,c,d)) * (d — 0) < d — (c — fmax) + (1 — ya), 
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xf + (fmin — d): Ys 2 fmin, 
Lf trz(a,b,c,d)) £ 1 — Ys, 

Yı + Y2 + Y3 + Ya +Yys =1, 

yi € {0,1}, £f € [fmin, fmax]} , 


where y; are new variables. 


(=trz). If a(f trz(a,b,c, d)) € Tk, then add (f trz(a,b,c,d)) to Tk and set 


Ck = CKU {rf trz(a,b,c,d)) = 1. — Tf trz(a,b,c,d))} ; 


(w). If w- pe Tx, then 

1. add ¢ to Tk 

2. Cr := Cp U (tog = W: £o}. 
(~w). If =(w- 0) € Tk, then 

1. add nnf(=0) to Tx 

2. Ck := Ce U {w.p = W: xo}. 
(ws). If wi -Yı +... + we: Yk € Tk, then 

1. add all y; to Tic 

2. CK = CRU toto = Do; Wi Tap). 
(nws). If —(u -Yı +... +wp: Yk) € Tx, then 

1. add all =p, to Tk 

2. Cpe = Ce UAB a tpi tte any = 1 DE Ep 
(Im). If lm/(a,b)(¢) € Tx, then add ¢ to Tx and set 


Ck = CeU{ t¿-y<a, 
Lp Za-y, 
rg + a/b-y > afb: Lim(a,b)(4) 
Lo — YS afb: Lim(a,b)(6)) 
(a — 1): Lim(a,b)(6) + (1 — b) - £o < 
(a — 1): Zim(a,b)(¢) + (1— 8) - £4 4 
y € {0,1}} 
where y is a new variable. 
(Aim). If alm(a, b)(@) € Tic, then 
1. add Im(a, b)(¢) to Tk 


2. Cr := Cx U {2 1m(a,b)(6) = 1 — Zim(a bH) } - 
(owa). If QR" (v1,...,Ve) € Tic, then add all y; to Tx and set 
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k 

Ck :=Cx Uta...) = ) wi Yi) U permas- Tp), 
i=1 

where W = (w1,..., Wk) and perm(a1,...,2%) is the set of constraints 


(N = {1,...,k} and i,j € N) 


perm(z1,...;£k) ={ Yr > asc Uh 2 Yk, 
Yi S Lj + 2245, 


Tj < Yi + 2245, 


k 
X zi =k— 1, 
j=1 

k 

yoz =k— 1, 
i=1 


Yi € [0, 1], zij € (0, 1) } > 
where y;, zij are new variables. 
(—owa). If Q572(p1,...,Yx) € Tx, then 
1. add OQ *(Y1,..., Yk) to Tk 
2. Ck := Ck U {Tawa (y, ba) > 1 — TAQWA (ap...) F 
(qowa). If Owe 1, ...;Wk) € Tx, then add all y; to Tk and set 


Ck :=Ck U (Tra (pr...) = y Wi: Yi} U permis: Ep), 
i 


where Wg = (w1,..., wg) is such that 


1 1-1 


wi =E) -QE 


and perm(zx1ı,..., £p) is as for the (owa) rule. 
(—qowa). If “OWE (d1,.--, Pe) € Tx, then 
1. add QWE (1, ---, Ur) to Te 


2. Cx :=Cx U {2-agwa (a Pr) T 1— TAWA (apy, 44) F- 


Some comments are in place. 


Remark 49 (hard datatype rules). For, e.g., the (>) rule, note that strict 
inequalities are not allowed, and hence, the inequality xp < m is expressed as 
xj <m-— pe, for a sufficiently small pe € (0, 1]. 
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Remark 50 ((ws), (ws) rules). For the (ws) and (ws) rules note the 
following: 


e concerning (ws), w1Y1+...+w-Vg is equivalent to wi -Y1V...VWx Vr; 


e concerning (ws), (w1 : Yı +... + wk: We) îs the same as 01 - Y A 
.. Awk * Yk, and thus, we would 


1. add all Wi * Ci to Tk; 
2. set Ck := CxkUfmax(0, O; Lawi) —(n—1)) E Dar b1+ +00 De) 
The above rules can then further be simplified by exploiting the fact that 


Tips 5 Wi Ly; 


II 


Capi L= ip 


and applying the weighted sum rule. 
Remark 51 ((lm) rule). For the (lm) rule note the following: 
e if To <a the Tim(ab)($) = a` Po; 


b 
e if re > a the Lim(a,b)($) = 2 -Lot (1 bt). 


Remark 52 ((owa) rule). The (owa) rule has been obtained directly from the 
encoding for Fuzzy DLs with aggregation operators [58], which we recap here. 


Yager [467] has shown that the maximization of Oy *(21,..., Tn) can in- 
deed be encoded as a MILP problem. However, this encoding does not work 
în our setting as we are not maximizing OF? (x1,..., £k), but rather need to 
show that the set 

NOS) D (za, pisa ¡Da O (1, ah ols Tk) = x} 


is MILP representable. To this end, we need a different encoding. 
So, let N = {1,..., k}. Similarly to [467], we introduce new [0, 1]-valued 
variables y; (i € N) and impose 


YZ Y2... Yk. (F.4) 


The intuition is that y; will take the value of the i-th largest of the xj. Hence, by 
definition of OWA, y; has weight w; and therefore, we add also the constraint 


Wi i m (F.5) 


The remaining of the encoding concerns are now to establish which one among 
T1,..., Tp are the yi. Consider the following set of equations: for i, j € N, new 
binary variables zij 


Yi S Tj + 22ij Tj < Yi + 224; 
(F.6) 
ij == 1 zi =k=1 zig E€ {0,1} . 
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It can be verified that (i) if zij = 0 then y; = x; is imposed; (îi) for any yi, 
there is only one x; imposed to be equal to yi; and (iii) for any xj, there is 
only one y; imposed to be equal to xj. That is, there is a bijection among the 
yj and the zi, which together with Eq. (E.6) guarantees that the equations 
(E.6) - (F.6) correctly encode Ox? (x1, ..., £k) = 1. 


Appendix G 


Fuzzy SPARQL: a Query Language for 
Fuzzy RDFS Graphs 


The aim of this appendix is to introduce a fuzzy SPARQL, a fuzzy extension 
of SPARQL that is compliant with the crisp variant described in Section 6.3. 


G.1 Fuzzy SPARQL 


Fuzzy SPARQL [476, 264] extends SPARQL allowing to query fuzzy RDFS 
graphs. 

A Fuzzy SPARQL has its counterpart in a generalization of fuzzy conjunc- 
tive queries, which are of the general form 


q(x,A,a,z)  3y3A px A, y, A”), 
GroupedBy(w), 
a:=0[f(2)l, 
OrderBy(2) . 


The semantics is a straightforward extension of the one that has been 
defined in Section 9.3 (see also [476]). 

Now, the Fuzzy SPARQL query languages are defined as follows [476]. A 
simple fuzzy SPARQL query is defined — analogously to a SPARQL query — 
as a triple Q = (P, G, V, A) with the differences that 


e Gisan fuzzy RDF graph; 
e we allow fuzzy graph patterns and 


e Ais the set of fuzzy variables taken from an infinite set A (distinct from 
V). 


We further denote by fvar(P) the set of fuzzy variables present in a graph 
pattern P. 

Let A be a fuzzy value from [0, 1] or an fuzzy variable from A. We call A an 
fuzzy label. Triple patterns in fuzzy SPARQL are defined the same way as in 
SPARQL. For a triple pattern 7, we call (7, A) an fuzzy triple pattern and sets 
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of fuzzy triple patterns are called basic fuzzy patterns (BFP). A generic fuzzy 
graph pattern is defined in a recursive manner: any BFP is a fuzzy graph pat- 
tern; if P and P’ are fuzzy graph patterns, R is a filter expression (see [381]), 
then (P AND P”), (P OPTIONAL P”), (P UNION P”), (P FILTER R) are fuzzy 
graph patterns. 


Example 62. Suppose we are looking for rich people who own a sports car. 
This query can be posed as follows: 


SELECT ?p ?11 ?c WHERE 4 
<(?p type RichPerson), ?11> 
OPTIONAL{<(?p hasSportsCar ?c), ?12>) 
} 


Assuming our example dataset has the following triples: 


((toivo, type, Rich Person), 0.8) 
((toivo, hasSportsCar, ferrari), 1.0) 
((toivo, hasSportsCar, audiTT), 0.7) 


we will get the following answers: 


0, = {?p/toivo, ?11/0.8} 
02 = {?p/toivo, ?11/0.8, ?c/ ferrari} 
03 = {?p/toivo, ?12/0.8,?c/audiTT} . 


The first answer corresponds to the answer in which the OPTIONAL pattern 
is not satisfied. In the second and third answers, the OPTIONAL pattern is 
also matched. 


From a semantics point of view, the semantics of fuzzy SPARQL queries extend 
the notion of SPARQL BGP matching. As for the SPARQL query language, 
we are going to define the notion of solutions for BFP as the equivalent notion 
of answers set of fuzzy conjunctive queries. Just as matching BGPs against 
RDF graphs is at the core of SPARQL semantics, matching BFPs against 
fuzzy RDF graphs is the heart of the evaluation semantics of fuzzy SPARQL. 

We extend the notion of substitution to include a substitution of fuzzy 
variables in which we do not allow any assignment of a fuzzy variable to 0. A 
value of 0, although it is a valid answer for any triple, does not provide any 
additional information and thus is of minor interest. Furthermore this would 
contribute to increasing the number of answers unnecessarily. 

Let P be a BFP and G an fuzzy RDF graph. We define evaluation [Ple as 
the list of substitutions that are solutions of P, i.e., [Pla = {0 | G = 0(P)), 
and where G | 0(P) means that any fuzzy triple in 0(P) is entailed by G. 

As for SPARQL, we have: 


Proposition 98. Given a fuzzy graph G and a BFP P, the solutions of P 
are the same as the answers of the fuzzy query q(var(P)) + P (where var(P) 
is the vector of variables in P), i.e., ans(G, q) = [Ple. 
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For the extension of the SPARQL relational algebra to the fuzzy case we 
introduce — inspired by the definitions in [348] — definitions of compatibility 
and union of substitutions: 

Two substitutions 6; and 02 are Q-compatible if and only if (i) 01 and 
02 are compatible for all the non-fuzzy variables, i.e., 01(x) = 02(x) for any 
non-fuzzy variable x € dom(9,) N dom(02); and (îi) 01 (A) 8 02(A) 4 0 for any 
fuzzy variable A € dom(9,) N dom(03). 

Given two S-compatible substitutions 0, and 02, the Q-union of 0, and 
02, denoted 01 Q 02, is as 01 U 02, with the exception that any fuzzy variable 
A € dom(01) N dom(02) is mapped to 01(A) Y 92(A). 

We now present the notion of evaluation for generic fuzzy SPARQL graph 
patterns. Let P be a BFP, P, P fuzzy graph patterns, G an fuzzy graph 
and R a filter expression, then the evaluation [+]a, i.e., set of answers,! is 
recursively defined as: 


e [Pla = 10 | dom(0) = var(P) and G H 0(P)) 


e |P, AND Pala = (01 ® b2 | 01 € [Pila, 02 € [PoJa, 01 and 02 @-compatible} 
o Pı UNION Pola = [Pila U [Pala 
+ [P, FILTER Re = {0 | 6 € [Pile and RO is true) 


e [Pi OPTIONAL P2|R]]e = 
10 | and 6 meets one of the following conditions: 


1. 0= 0, @ 62 if 8, € [Pi], 92 € [Pala, 01 and 028-compatible, and 
RO is true; 

2. 0=01 € [Pila and ve € [P2]e such that 01 and 02 Q-compatible, 
R(01802) is true, and for all fuzzy variables A € dom(0,)Ndom(62), 
b(A) =<01(A); 

3. 0=01 € [Pila and ve € [Po] such that 01 and 02 Q-compatible, 
R(01 Q 02) is false }. 

Let R be a FILTER expression and x,y € AU L, in addition to the FILTER 


expressions, we further allow the expressions presented next. The valuation of 
R on a substitution 6, denoted RO is true if:? 


e R=(< y) with x,y € dom(0) ULA 0O(x) < 0(y); 


e R= p(z) with p(z)@ = true if and only if p(0(z)) = true, where p is a 
built-in predicate; 


1 Strictly speaking, we consider sequences of answers — note that SPARQL allows du- 
plicates and imposes an order on solutions, see below for more discussion — but we stick 
with set notation representation here for illustration. Whenever we mean “real” sets where 
duplicates are removed we write {...}pistincT- 

2We consider a simple evaluation of filter expressions where the “error” result is ignored, 
see [381, Section 11.3] for details. 
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e Otherwise RO is false. 


Remark 53. For practical convenience, we retain in [-]a only “domain maz- 
imal answers.” That is, let us define 0” < 0 if and only if (i) 0 4 0; (it) 
dom(0) = dom(0'); (iii) O(a) = 0'(x) for any non-fuzzy variable x; and (iv) 
0) < O(A) for any fuzzy variable A. Then, for any 0 € [P]a we remove any 
0 € [Pla such that 0' < 6. 


The following proposition shows that we have a conservative extension of 
SPARQL: 


Proposition 99 ({476, 264]). Let Q = (P,G,V) be a SPARQL query over an 
RDF graph G. Let G' be obtained from G by annotating triples with 1.0. Then 
[Pla under SPARQL semantics is in one-to-one correspondence to [Pla 
under fuzzy SPARQL semantics such that for any 0 € [Pla there is a 0 € 
[Ple with 0 and 0’ coinciding on var(P). 


Next, we will present extensions to include variable assignments, aggre- 
gates, and solution modifiers. These extensions are similar to the ones pre- 
sented in Section 6.3. 

Let P be a fuzzy graph pattern and G a fuzzy graph, the evaluation of an 
ASSIGN statement is defined as: 


[P ASSIGN f(z) AS zla = {0 | 61 € [Pla, 0 = 01[2/£(01 (2))]) 


where 
_ f ou {z/t} if z 2 dom(0) 
0[2/t] = { (0 \ {z/t'}) U{z/t} otherwise . 


Essentially, we assign to the variable z the value f(@1(z)), which is the evalu- 
ation of the function f(z) with respect to a substitution 01 € [P]c. 


Example 63 ([476]). Using a built-in function we submit a query such as: 


SELECT ?p ?c ?z WHERE { 
<(?p type RichPerson), ?11> 
<(?p hasSportsCar ?c), ?12> 

ASSIGN ?11 * 712 AS ?z 

} 


Fuzzy SPARQL also supports the ORDERBY clause where the evaluation of 
a |P ORDERBY ?z]c statement is defined as the ordering of the solutions — 
for any 0 € [Pla — according to the values of 0(?x). Ordering for non-fuzzy 
variables follows the rules in [381, Section 9.1]. 

In case the variable x is a fuzzy variable, the order is induced by <. In case 
the order is some partial order then we may use some linearization method 
for posets, such as [248]. Likewise, the SQL-like statement LIMIT(&) can be 
added straightforwardly. 
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We can further extend the evaluation of fuzzy SPARQL queries with ag- 
gregate functions 


Q e (SUM, AVG, MAX, MIN, COUNT, 9, 9) 


as follows: 
The evaluation of a GROUPBY statement is defined as:? 


[P GROUPBY(w) Gf(z) AS ala = {0 | 0, in [Pla, 
0 = O;|wlai/@ fi(O:(Z))] }oistinct 


where the variables a; ¢ var(P), z; € var(P) and none of the GROUPBY vari- 
ables w are included in the aggregation function variables z;. Here, we denote 
by |w the restriction of variables in 8 to variables in w. Using this notation, we 
can also straightforwardly introduce projection, i.e., sub-SELECTs as an alge- 
braic operator in the language covering another new feature of SPARQL 1.1: 


[SELECT V (Pia = (0101 in [Ple,0=011w) . 


Remark 54. Please note that the aggregator functions have a domain of 
definition and thus can only be applied to values of their respective domain. For 
example, SUM and AVG can only be used on numeric values, while MAX, MIN 
are applicable to any total order. Resolution of type mismatches for aggregates 
is currently being defined in SPARQL 1.1 [380] and we aim to follow those, 
as soon as the language is stable. The COUNT aggregator can be used for any 
finite set of values. The last two aggregation functions, namely Y and 8, are 
defined by the fuzzy domain [0,1] and thus can be used on any fuzzy variable. 


Example 64. Suppose we want to know, for each person, the average score 
of being rich and having a sports car. Then such a query will be expressed as: 


SELECT ?x ?avgS WHERE 4 
<(?p type RichPerson), ?11> 
<(?p hasSportsCar ?c), ?12> 
GROUPBY (7p) 
AVG(?11 * 712) AS ?avgL 
} 


Proposition 100. Assuming the built-in predicates are computable in finite 
time, the answer set of any fuzzy SPARQL is finite and can also be computed 
in finite time. 


3In the expression, @f(z) AS a is a concise representation of n aggregations of the form 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


Appendix H 


Tableau Calculi for Fuzzy SHTF, 


As we have seen in Appendix C, the major issue introduced by functional roles 
is that together with inverse roles they may cause a concept to be satisfiable 
in an infinite model only as Example 60 illustrates. Of course, this property 
applies to fuzzy SHTF, as well. 

In order to cope with this issue, like for the crisp DLs case, we use the 
so-called notion of pairwise blocking to guarantee the correct termination of 
a tableau from which then we may build a possibly infinite model. 


To start with, in this appendix, we restrict fuzzy RIAs in a fuzzy 
RBoz R to be of the form RE Ra only. 


Then the definitions of Inv(R), Trans(R) and CR (the transitive-reflexive clo- 
sure of E over R’ = RU(Inv(R¡)Elnv(R2) | RıERz € R}) are as for the crisp 
case (see Appendix C). 


H.1 Analytical Fuzzy Tableau under SFL 


We now extend the analytical fuzzy tableau under SFL seen in Sec- 
tion 10.6.1.2 for fuzzy ALC to fuzzy SHT.F,. So, let K be a SHTF, KB. The 
notions of completion-forest, clash, and the initialization of a completion-forest 
is like Section 10.6.1.2. 

If nodes v and w are connected by an edge (v, w) then w is called a successor 
of v and v is called a predecessor of w. Ancestor is the transitive closure of 
predecessor, and descendant is the transitive closure of successor. 

If nodes v and w are connected by an edge (v, w) with (R’,n) € L((v, w)) 
and R’ CR R, then w is called an R,-successor of v and v is called an Rn- 
predecessor of w. If node w is an R,-successor of v or an Inv(R)„-predecessor 
of v, then w is called an R,-neighbor of v. 

Similarly to the crisp case, we also need a technical definition involving 
functional roles. Let F be forest, R a functional role such that we have two 
edges (v,w 1) and (v, wa) such that (R,n,) occurs in £((v,w1)) and (R,n2) 
occurs £((v, wa)), respectively. Then we call such a pair a fork. As R is func- 
tional, such a fork means that w, and wa have to be interpreted as the same 
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individual. Such a fork can be deleted by adding both £L((v, w2)) to L((v, w1)), 
and L(w2) to L(w1), and then deleting node w2. Of course, as inverse roles 
are allowed these may also contribute to create a fork, î.e., we have a fork if 
w, and wa are R-neighbors of v. 

We assume that forks are eliminated as soon as they appear (as part of a 
rule application) with the proviso that newly generated nodes are replaced by 
older ones and not vice versa. 

At the beginning, we check that there are no forks in the initial forest, 
otherwise the KB is not satisfiable (due to the unique name assumption, if 
we have ((a,bi):R,n1) and ((a,b2):R,n2) with bı 4 b2 then R cannot be 
functional). 

The notion of pairwise blocking, blocked node, directly blocked, and indi- 
rectly blocked is as for crisp SHTF, (see Appendix C). 

The set of reasoning rules are shown in Table H.1, where e is given by 
Equation (10.13): 


TABLE H.1: The tableau rules for fuzzy SHTF,. 


(N). If (i) (C1 Ca2,n) € Llv), (îi) [(C1,n),(Ca,n)+ Z L(v) and (iii) node v 

is not indirectly blocked, then add (C¡,n) and (C2,n) to L(v). 

(U). If (i) (C1 UC2,n) € L(o), (îi) {(Ci, n), (C2, n) OL(v) = f and (iii) node 

v is not indirectly blocked, then add some (C,n) € ((C1,n), (C2,n)} to 

L(v). 

(V). If (i) (WR.C,n) € L(v), (îi) there is an R„-neishbour w of v with m > 

1—n, (iii) (Cin) € L(w), and (iv) node v is not indirectly blocked, then 

add (C, n) to L(w). 

(V+). If (i) (VS.C,n) € L(v), (ii) there is some R with Trans(R) and R E} S, 

(iii) there is an Rm-neighbour w of v with m > 1 — n, (iv) (VR. C, g 

L(w) and (iii) and node v is not indirectly blocked, then add (WR.C, n) 
to L(w). 

(3). If (i) AR.C,n) € L(v), (ii) there is no R,,-neighbour w of v with 
(C, n2) € L(w) such that min(n1, n2) > n, and (iii) node v is not 
blocked, then create a new node w, add (R,n) to L((v,w)) and add 
(C,n) to L(w). 

(E). If i) (TE Dyn) €T, (ii) (D,n) g L(v) and (îi) node v is not indirectly 

blocked, then add (D,n) to L(v). 

(Č). If (i) CED €T, (îi) for some n € NE, {(nnf(-C),1—n+.),(D,n)}N 

L(v) = 0 and (iii) node v is not indirectly blocked, then add E to L(v) 

for some E € {(nnf(7C),1—n+ 6), (D,n)}. 


It can be shown in a similar way as in [389] that: 
Proposition 101. For each SHTF, knowledge base K 
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1. the tableau algorithm terminates; 


2. if the expansion rules can be applied in such a way that they yield a 
complete and clash-free completion forest, then K has a model; 


3. if K has a model, then the expansion rules can be applied in such a way 
that they yield complete and clash-free completion forest for K. 


H.2 Fuzzy Tableau for Finite-Valued DLs 


We shortly show how one may extend the tableau calculus illustrated in 
Section 10.6.1.3 for ALC to SHTF, as well. 

So, let K be a SHZF, KB. The notions of completion-forest, successor, 
predecessor, ancestor, and neighbor and the management of forks are as in 
Section H.1. The notions of clash and the initialization of a completion-forest 
is like Section 10.6.1.3. The notion of pairwise blocking is as for the crisp case. 

The set of reasoning rules are shown in Table H.2: 

We have that 


Proposition 102 ([73]). For each SHTF, knowledge base K with a finitely 
valued truth space Ln 


1. the tableau algorithm terminates; 


2. if the expansion rules can be applied in such a way that they yield a 
complete and clash-free completion forest, then K has a model; 


3. if K has a model, then the expansion rules can be applied in such a way 
that they yield complete and clash-free completion forest for K. 


Of course, the method can be adapted to any truth space as well, as long as 
it is finite. 


H.3 Operational Research-based Fuzzy Tableau under 
SFL 


We now extend the OR-based fuzzy tableau under SFL seen in Sec- 
tion 10.6.1.4 for fuzzy ALC to fuzzy SHTF,. So, let K be a SHTF, KB. The 
notions of completion-forest, R-successor, R-predecessor, R-neighbor, succes- 
sor, predecessor, neighbor, and ancestor are as for the crisp case (see Ap- 
pendix C). As for ALC, a forest has associated a set CF of mixed-integer 
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TABLE H.2: The tableau rules for finitely valued fuzzy SHTF,. 


. If (i) (C1 NC2,m) € Liv), (ii) there are m,,ma € Ln such that m. ® 


ma = m with ((C¡,m1),(C2,ma)+ Z L(v), and (iii) node v is not 
indirectly blocked, then add (C1,m1) and (C2, M2) to L(v). 


. If (i) (C1 UC2,m) € Liv), (ii) there are m,,ma € Ln such that mu O 


ma = m with {(C1, m1), (C2, M2)} N L(v) = Ø, and (iii) node v is not 
indirectly blocked, then add some (C, k} € {(C1, m), (C2, ma) to L(v). 


„If (i) (5C,m) € L(v) with (C,em) g L(v) and (îi) node v is not indi- 


rectly blocked, then add (C, em) to £(v). 


. If (i) (Ci >C2,m) € Liv), (ii) there are m,,ma € Ln such that 


mi => mo = m and {(C1, ma), (0Co,mo)) Z L(v), and (iii) node v is 
not indirectly blocked, then add (C1, ma) and (C2, ma) to L(v). 


. If (i) WR.C,m) € L(v), (ii) (BR, mu) € L((v, w)), (iii) there is ma € Ln 


such that mı > ma > m with (C,ma) Z L(w), and (iv) node v is not 
indirectly blocked, then add (C,ma) to £(w). 


. If (i) (AR.C,m) € L(v), (ii) there are m1, ma € Ln such that mi 9 ma = 


m, (iii) there is no (R, mı) € L((v,w)) with (C,ma) € L(w), and (iv) 
node v is not blocked, then create a new node w, add (R, ma) to L((v, w)) 
and add (C, m2) to L(w). 


. If (i) (3R.C,m) € Liv), (îi) (R,m1) € L((v, w)), (iii) there is ma € Ln 


such that ma 9 ma < m with (C,ma) € L(w), and (iv) node v is not 
indirectly blocked, then add (C, ma) to £(w). 


. If (i) (WVR.C,m) € L(v), (îi) there are m,,ma € Ln such that mı => 


ma = m, (iii) there is no (R, mı} € L((v, w)) with (C,ma) € L(w), and 
(iv) node v is not blocked, then create a new node w, add (R,m) to 
L((v, w)) and add (C,ma) to L(w). 


. If (i) VS.C,m) € L(v), (ii) there is some R with Trans(R) and R E} S, 


(iii) there is an R,,, -neighbour w of v and ma € Ln such that mı > 
ma > m (iv) YR.C, m2) € L(w), and (iv) and node v is not indirectly 
blocked, then add (VR.C, ma) to L(w). 


. If i) (CE D,m) €T, (îi) there are m1, ma € Ln such that mi > ma > 


m, (iii) ((C,m1),(D,ma2)) Z L(v), and (iv) node v is not indirectly 
blocked, then add (C, mı) and (D, m2) to L(v). 


. If (i) CED € T, (ii) there are mı, m € Ly such that mı < ma, (iii) 


[(C,m1),(D,ma)+ Z L(v), and (iv) node v is not indirectly blocked, 
then add (C, mı) and (D, ma) to L(v). 
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linear programming constraints. The initialization of a completion-forest is 
like for ALC (see Section 10.6.1.4), too. 

We assume that forks are eliminated as soon as they appear (as part of 
a rule application) with the proviso that newly generated nodes are replaced 
by older ones and not vice versa. At the beginning, we check that there are 
no forks in the initial forest, otherwise the KB is not satisfiable (due to the 
unique name assumption, if we have ((a,b,):R,n,) and ((a,b3):R,n2) with 
bı Æ ba then R cannot be functional). 

The notion of pairwise blocking, blocked node, directly blocked, and indi- 
rectly blocked is as for crisp SHTF, (see Appendix C). 

The set of reasoning rules are shown in Table H.3: 

Again, it can be shown that using the rules in Table H.3 we have that: 


Proposition 103. For each knowledge base K = (T, A) 
1. the tableau algorithm terminates; 


2. if the expansion rules can be applied to a knowledge base K = (T, A) 
such that they yield a complete completion forest F such that Cr has a 
solution, then K has a (witnessed) model; 


3. if a knowledge base K = (T, A) has a (witnessed) model, then the appli- 
cation of the expansion rules yields a complete completion-forest for K 
such that CF has a solution. 


H.4 Reasoning with Fuzzy Concrete Domains and Ag- 
gregation Operators under SFL 


Fuzzy Concrete Domains. [398] first showed a general OR-based method 
to deal with concrete domains. Specifically, for a functional datatype property 
a fork is eliminated as soon as it appears, in the same way as for an object 
property (i.e., role). For a datatype property T, we assume that T’s values 
are rational numbers, [Tmin, Tmax] is the range of allowed values that T-fillers 
may take. The OR-based inference rules related to concept expressions of the 
form 


CD > VId|3T.d 


are the datatype property analogue of the (VY) and (3) rule in Table H.3, i.e., 


(Va). If (i) YT.d € L(v), there is a T-neighbor w of v, (îi) the rule has not been 
already applied to this concept, and (iii) node v is not indirectly blocked, 
then add d to L(w), and Cf := CF U [Lu:d > LyNT.a Q Tluw):T> Tan SS 
Ly S Tmax}. 
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TABLE H.3: The OR-based tableau rules for fuzzy SHZF, with GCIs. 


(var). For variable t,.c occurring in CF add zu:c € [0,1] to Cf. For variable 
Tw w):r occurring in Cz add £w w):r € [0,1] to Cr. 


(L). If L € L(v) then CF := CF U {£v =0). 

(T). ET € L(v) then CF := CFU {2y:7 = 1}. 

(A). If nA € L(v) then add A to L(v), and Cr := Cr Uf{ay.4 < 1 — Luna). 
(TM). If (i) C1 Ca € L(v), (ii) the rule has not been already applied to this 


concept, and (iii) node v is not indirectly blocked, then add Cı and Ca 
to L(v), and CF := CF U {£v:0; Q Lv:C, > Lv:0,ncp }- 

(U). If (i) Ci U C2 € L(v) and (îi) the rule has not been already applied to 
this concept, and (iii) node v is not indirectly blocked, then add Cı and 
Cz to L(v), and CF := CF U {tu:c, E £o:03 > Lv:Cyucy}- 

(Y). If (4) VR.C € L(v), there is a R-neighbour w of v and (îi) the rule has not 
been already applied to this concept, and (iii) node v is not indirectly 
blocked, then add C to £(w), and CF := CrU{tu:c > Ly:vR.C@L(v,w):R}- 

(Vi). If (i) VS.C € L(v), (ii) there is some R with Trans(R), R CR S and 
R € L((v,w)) and (iii) the rule has not been already applied to this 
concept, and (iv) node v is not indirectly blocked, then add VR.C to 
L(w), and CF := CF U {£w:yR.C > LuvR.c 8 L(y,w):R}- 

(3). If (i) 3R.C € L(v) and (ii) the rule has not been already applied to this 
concept and (iii) node v is not blocked then create a new node w, add R 
to L((v, w)), add C to £(w), and Cf := CFU{ £ww): RI Tw:C > Lv:3R.C)- 

(E). If (i) (TED nm) €T, (îi) vis a node to which this rule has not yet been 

applied, and (iii) node v is not indirectly blocked, then add D to L(v) 

and set CF := CF U {£v:D > n}. 

(E). If (i) CED e T, (îi) v is a node to which this rule has not yet been 
applied, and (iii) node v is not indirectly blocked, then add nn f(=C') 
and D to £(v) and set CF := CF U {£v:nnf( 00) Di Lv:p > 1}. 


~ 

LL] 
a 

Ww 


. If (i) AT.d € L(v) and (ii) the rule has not been already applied to 
this concept and (iii) node v is not blocked then create a new node w, 
add T to £L((v, w)), add d to L(w), and Cr := CF U {£w w):T 8 Lw:a > 
Tv:IT.d; U(v,w):T € 10, 1}, Tin < fy < Tass 


In the rules above, £w is the variable representing the concrete value of a 
concrete predicate d (e.g., see the case of ls below). 

In the following we assume that a function L is MILP representable (see 
Section 8.2.2.2) and for L we consider the graph g(L) = {(x1, £2) € [0,1] x 
[0,1] | L(x,) = x2}. The general inference rules to deal with concrete domains 
are 
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(d). If d € L(w) and the rule has not been already applied to this concept 
then Cf = Cr U y(w,d), where the set y(w, d)) is obtained from the 
bMILP representation of g(d) by replacing all occurrences of x2 with 
Lw:a and x1 with ty. 


(~d). If sd € L(w) and the rule has not been already applied to this concept 
then add d to £(w) and set CF = CF U [Lua = 1 — Lua). 


For instance, the inference rule for Is(a,b) € L(w), where w is a T-successor 
of v, is obtained immediately from the (ls) rule for P(D) (Appendix F) in 
which we replace 


e fmax and fmin with Tmax and Tmin, respectively; 
e xp with zu; and 
© Tf Is(a,by) With Ly:15(a,b)- 

That is, the rules for ls(a, b) are 


(ls). If Is(a,b) € L(w), the rule has not been already applied to this concept 
and w is a T-successor of v, then 


Cr =CrU{ Tw + (Tmax — a) Y1 < Tmax, 


Tw:ls(a,b) 2y, 


Tw + (Tmin — @) y2 > Tmin, 
Tw + (Tmax = b) : y2 < Tmax, 
Ly + Zu:ls(a,b) * (b — a) > b — (b — Tmin) (1 — yo), 
Ly T Tw:ls(a,b) * (b — a) <b— (a — Tmax) * (1 — y2), 
Tw + (Tmin — b) - Y3 2 Tmin, 


Tw:ls(a,b) <1- Y3, 
yı +y2 +yz =1, 
Yi € {0, 1} > 


where y; are new variables. 


(Als). If =1s(a,b) € L(w) and the rule has not been already applied to this 
concept then add Is(a,b) to L(w) and set CF = CF U {£w:is(a,b) = 
1- Cta de 


The rules for d € {rs(a, b), tri(a, b,c), trz(a, b, c, d), rs(a, b), atri(a, b,c) and 
—trz(a, b,c, d)} occurring in £(w), where w is a T-successor of v, are obtained 
analogously from the (d) rule for P(D) (Appendix F) in which we replace 


e fmax and fmin with Tmax and Tmin, respectively; 


e xf with zu; and 
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e ZF a) with Zw:d- 


The inference rules for the cases d € {>m,<m,=m} can be worked out sim- 
ilarly. Indeed, let pe as determined in Appendix F. Then the inference rules 
for d € {>m,<m,=m} occurring in L(w), where w is a T-successor of v, are 
obtained immediately from the (d) rule for P(D) in which we replace 


e fmax and fmin with Tmax and Tmin, respectively; 

e xp with zu; 

o Tf d) with XLw:d- 
Fuzzy Modifiers. We next show how to deal with fuzzy modifiers as de- 
scribed in Section 10.3. So, concept expressions are of the form 


C => m(C) | VT.m(d) | 3T.m(d) 


where we further assume that m is a linear modifier. The rules for Im(C) or 
im(d) occurring in £(w) can again easily be derived from the (Im) rule for 
P(D). Specifically, we have 


(Im). If Im(a,b)(E) € L(w), where E is a concept C or a concrete fuzzy 
predicate d, the rule has not been already applied to this concept and 
if E is a concept then node w is not indirectly blocked, then add X to 
L(w) and set 
Cr :=CFUÍ twE-=YyS<S0, 

Tw:E > a: Y, 

Tw:E + a/b Y 2 a/b ` Lw:lm(a,b)(E)> 

Tw:E — Y < a/b ` Lw:lm(a,b)(E)> 

(a 73 1) ` Lw:lm(a,b)(E) F (1 = b) -Zw:E S (a b) “YH 2(1 E y), 
(a pi 1) * Lw:lm(a,b)(E) E (1 Zu b) ¿La:E + 2(1 vi y) > (a ore b) Y, 
y € {0, 1}} 


where y is a new variable. 


img). If =Im(a,b)(E) € L(w), where E is a concept C or a concrete fuzzy 
predicate d, the rule has not been already applied to this concept and 
if E is a concept then node w is not indirectly blocked, then add Img 
to L(w) and set Cf = CF U {taimien =1— Tw:Im(a,b)(E) }- 


Aggregation Operators. We start by showing how to reason with weighted 
sum concepts, i.e., concept expressions of the form 
wy Ci +: +H Wk: Ck 


where m; € (0,1), X; m; < 1. 
The inference rules are derived immediately from the corresponding one 
for P(D) in which we replace 4; with C; and, thus, get 
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(ws). If wı -C1 +--+ wg: Ck € L(v), the rule has not been already applied 
to this concept, and node v is not indirectly blocked, then 


1. add C1,...,Cg to L(v); 
2. set Cr = Cr U ¡ae Wi Lu:C, = DILO TEA 


(ws). If =(w Cr +--+ kr Cu) € L(v), and the rule has not been already 
applied to this concept, and node v is not indirectly blocked, then 


1. add =C1,..., Cg to L(v); 
2. set CF := CF U {1 — De Wi ` Ly:C; = Rs TE e a eae a 
Remark 55. Note that, by Remark 33, to guarantee decidability in presence 


of the weighted sum operator, we need to restrict the attention to acyclic KBs 
only. 


Eventually, we address how to deal with OWA and the quantifier-guided OWA 
(see Section 8.1.6 and Section 10.3). Not surpassingly, again the inference rules 
can immediately be derived from the (owa) and (qowa) rules for P(D) in which 
we replace p; with C; and, thus, get 


(owa). If ApF*(C1,...,Cx) € L(v), the rule has not been already applied to 
this concept, and node v is not indirectly blocked, then 


1. add C1,...,Cg to Lv); 


2. set 
k 
Cr = CEU {2y.aowa(c,,...c,) = > wi yi} U 
ier 
permíty:0, ; os, ,Ly:C, ) y) 
where W = (w1,...,wg) and perm(x1,..., tk) is the set of con- 


straints (N = {1,...,k} and i,j € N) 


permíx1,..., Ur) ={ Yr >Yo2---,Yh-1 > Yk, 
Yi < Tj T 22445 


Tj < Yi + 2245, 


k 
DE = k= 1, 
j=1 

k 
Si =k-= 1, 
i=1 


yi € [0,1], zi; € {0,1} }, 


where y;, zij are new variables. 
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(us). If Q9*(C1,..., Cp) € L(v), and the rule has not been already ap- 
plied to this concept, and node v is not indirectly blocked, then 


1. add Q? (C1, .--, Ck) to L(v); 


2. set CF := CF U {1 — Ty: QQUA(C),....Ch) = Ty: QQWA(C) ..... Cr) 


dadas 


It is straightforward to see now that indeed also the inference rules for 
quantifier-guided OWA, namely rules (qowa) and (—qowa) for P(D), can be 
adapted to the fuzzy DL case in a similar manner as we did for the OWA. 

Eventually, note that Remark 55 applies to OWA and quantifier-guided 
OWA as well. 
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